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OM nucleic - nucleic search, using sw model 
Run OR: 



January 6, 2004, 01:15:17 ; Search time 2533 Seconds 

(without alignments) 
9541.130 Million cell updates/s 



T i t le: US-10-088-872-1 

Squence SCOre ' "^aaaaaaatgcctttgtt tgaagaaaacggccccttga 1014 

Scoring tabic:: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 22781392 seqs , 12152238C56 residues . . 

Total number of hits satisfying chosen parameters: 45562784 

Minimum DB seq length: 0 

maximum DB seq length: 2000000000 

"*oi-:r.. processing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



EST:* 





em_estba : * 




2 : 


em esthum:* 




> : 


im^cstin: * 




4 : 


em estmu : * 




5 : 


em__estov: * 




o : 


em_estpl : * 




7 : 


em estro:* 




8 : 


em htc : * 




9: 


gb_estl:* 




10 : 


gb_est2 : * 




11 : 


gb htc : * 




12 : 


gb_est3 : * 




13 : 


gb_est4 : * 




14 : 


gb_est5 : * 




15 : 


em__est fun . * 




15 : 


em estom : * 




17 : 


em gss_hum: 


* 


18 : 


em_gss_inv: 




19: 


em__gss_pln : 


* 


20 : 


em gss_vrt : 


* 


21 : 


em gss__fun: 


* 


22: 


em gss_mam: 


* 


23 : 


em gss_ mus : 


* 


24 : 


em gss_pro: 


-k 


25 : 


em gss_rod: 


* 


26: 


em_gss_phg : 


* 


27 : 


em_gss_vrl 





28: gb_gssl:* 
29: gb_gss2:* 



Pred No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and jp derived by analysis of the total score distribution. 
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AK07 686 7 Mus muscu 
AK03 0474 Mus muscu. 
AK053642 Mus muscu 
AK076758 Musmuscu 
AK0132 05 Mus muscu 
AU125107 AU125107. 
BX393735 BX393735 
AK00532 3 Mus muscu 
BG21873 5 RST3 84 7.6 
AK013161 Mus muscu 
BU116522 603139786 ' 
BQ6 6 9953 AGENCO URT 
BU518 807 AGENCOURT 
CD354831 UI-M-GIW- 
Bx4 83 012 Homo sapi 
CA97 3 07 8 AGENCOURT - 
BU5 14 92 0 AGENCOURT 
AA278473 zs81hl2.X 
CA982 682 AGENCOURT 
CD10 3 801 AGENCOURT 
BF159587 60.17690.84 
AW242839 xn26f05.X 
BU914186 AGENCOURT 
BQ8 99617 AGENCOURT 
CBS 5 9060 AGENCOURT 
BU367571 6037877.11 
BQ94 0 64 7 AGENCOURT 
CB2 00465 AGENCOURT 
BQ021348 UI-H-DH1- 
CA983 0 82 AGENCOURT 
BI499153 ie27c05 .y 
BU9052 64 AGENCOURT 
BI655225 603284039 
BQ8 802 00 AGENCOURT 
BU152 3 76 AGENCOURT 
AI645170 mt82a02.y 
AL961440 AL961440 
CBS 643 3 8 AGENCOURT 
BI828691 603074707 
CB2 354 7 8 AGENCOURT 
AA669484 af74g07.r 
BU631151 UI-H-FE1- 
BU129770 603117261 
CB524885 UI-M-FY0- 
CA97 0822 AGENCOURT 
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ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
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AUTHORS 

::ttle 

JOURNAL 
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i'UBMED 
• TERENCE 
.viJTHORS 

JOURNAL 
MIDLINE 
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TITLE 

JOURNAL 
MEDLINE 
PUB MED 
REFERENCE 
AUTHORS 



AK076867 1552 bp mRNA linear HTC 07-DEC-2002 

Mus musculus adult male testis cDNA, RIKEN full-length enriched 
library, clone : 4930520C08 product : M02 5 -LIKE PROTEIN homolog [Homo 
sapiens], full insert sequence. 
'AKO 7 b" 8 67 

AK076867.1 GI: 26345723 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Carninci, P. and Hayashizaki , Y . 

High-efficiency full-length cDNA cloning ; 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

Carninci , P. , Shibata,Y-, Hayatsu,N., Sugahara,Y., Shibata^K. , 
itoh,M. , Konno ( H., Okazaki, Y ; , Muramatsu, M. and Hayashizaki, Y. 
■normalization and subtraction of cap-trapper-selected cDNAsi to 
prepare full-length cDNA libraries for rapid discovery of new genes. 
Genome Res. 10 (10), 1617 -l.«0 (2000) 
204 99374 
11042159 

"> f . '- •• 

3hibata,K., Itch, M. , Aizawa , K . , Nagaoka , S ... Sasaki , N. , Carninci, P., 
Konno,H., Akiyama,J., Nishi,K., Kitsunai , T . , Tashiro,H., Itoh/M. , 
Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A. , 
Yamamoto,R., Matsumoto , H . , Sakaguchi , S . , Ikegami,T., Kashiwagi , K . , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yorieda,Y. .. Ishikawa,T., Ozawa,K., Tanaka,T. , Matsuura,S., Kawai,J., 
Okazaki, Y., Muramatsu , M . , Inoue,Y., Kira,A. and Hayashizaki , Y . 
RIKEN integrated sequence analysis (RISA) system- - 3 84 - format 
sequencing pipeline with 384 mult icapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 

4 . • 

Xawai,J., Shinagawa, A. , Shibata,K., Yoshino,M., Itoh,M., Ishii,Y., 
Arakawa,T., Hara,A., Fukunishi , Y . , Konno,H., Adachi,J., Fukuda,S., 
Aizawa,K., Izawa,M., Nishi,K., Kiyosawa,H., Kondo,S., Yamanaka,I., 
Saito,T., Okazaki,Y., Gojobori,T., Bono,H., Kasukawa,T., Saito,R., 
Kadota,K., Matsuda,H., Ashburner , M . , BataIov,S. # Casavant,T., 
Fleischmann, W. , Gaasterland, T . , Gissi,C, King,B., Kochiwa,H., 
Xuehl,P., Lewis, S., Matsuo,Y., Nikaido,I., Pesole,G., 
Quackenbush, J. , Schriml , L . M . , Staubli,F., Suzuki, R. , Tomita, M. r - 
Wagner, L., Washio,T., 3akai,K., Okido,T., Furuno,M., Aono,H., 
Baldarelli, R. , Barsh,G., Blake, J., Boffelli # D., Bojunga,N., 
Carninci, P., de Bonaldo, M . F . , Brownstein, M . J. , Bult,C, 
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AUTHORS 

TITLE 
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FEATURES 

source 



Fletcher, C, Fujita,M., Gariboldi , M . , Gustincich, S . , Hill,D., 
Hofmann,M., Hume, D. A., Kamiya f M. f Lee, N .H. , Lyons,?., 
Marchionni,L-, Mashima,J., Mazzarelli , J - , Mombaerts , P . , Nordone,P., 
Ring B , Ringwald,M., Rodriguez,!., Sakamoto, N. , Sasaki, H. , 
"ato'K , Schonbach,C, Seya,T., Shibata,Y., Storch,K.F., Suzuki, H., 
Toyo-oka,K., Wang,K.H., Weitz,C, Whi ttaker , C . , Wilmmg,L., 
Wynshaw-Boris,A., Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki, Y. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409 (6821), 685-690 (2001) 

21085660 

11217851 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. t , 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 

6 (bases 1 to 1552") . 
Adachi,J., Aizawa,K., Akahira, S . , Akimura, T . , Aono,k., Aral, A., 
*rakawa,T., Bono,H., Carninci,P., Fukuda,S., Fukunishi , Y . , 
Furuno,M., Hanagaki,T., Hara,A., Hayatsu,N. , Hiramoto,K., 
:^raoka,T., Hori,F., Imotani,K., Ishii,Y., Itoh,M. , Izawa,M., 
Kasukawa,T., Kato/H., Kawai,J., Kojima,Y., Konno , H . , Kouda, M . , 
. v r . yd g Kurihara,C, Matsuyama , T . , Miyazaki,A., Nishi,K., 
Hcmura,K., Numazaki,R., Ohno ; M . , Okazaki , Y . , Okido,T., Owa,C, . 
'saito,H., Saito,R., Sakai,C.,: 3akai,K., Sano,H., Sasaki, D 
■ Sh-;bata f K., Shibata,Y., Shinagawa, A. , Shiraki , T . , Sogabe,Y., 
>■ -uzuki , H . , Tagami,M., Tagawa,A. , Takahashi , F . , Tanaka,T., 
Tojima,Y., Toya,T. , Yamamura , T . , .Yamahaka , I - , Yasunishi , A . , 
Yoshida,K., Yoshino,M., Muramatsu . M . and Hayashizaki, Y. 
Direct Submission § , 

'.Submitted (16-APR-2002) Yoshihide Hayashizaki, The Institute otj- 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
vxplorrtion Research Group, RIKEN Genomic Sciences Center (GSC) , 
^IKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome - res@gsc . riken. go. jp, 
URL: http://genome.gsc. riken.go.jp/, Tel : 81-45-503 -9222 , 
v a x:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome. , • 
Encyclopedia Project of Genome Exploration Research Group m Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : ht tp : / /genome . gsc . riken . go . j p/ 
URL : http : / / f antom . gsc . riken . go . j p/ . 

Location/Qualifiers 1 • 

1. .1552 

/organism= n Mus musculus" 
/mol_type= H mRNA" 
/strain="C57BL/6J" 
/db_xref= M FANTOM^DB : 4 93C52 0C0 3 " 
/db~xref="MGI : 1894876" 
/ db_xr e f = M t axon : 1 0 0 9 0 " 
/clone= M 4930520C03" 
/sex= "male" 



/tissue_type= M testis" 

/clone__lib= " RIKEN full-length enriched mouse cDMA library" 
/dev_stage= " adult " 
CDS 316. .1320 

/note= "unnamed protein product; M025-LIKE PROTEIN homolog 
[Homo sapiens] (SWISSPROT | Q9H9S4 , evidence: FASTY , 
98.2%ID, 100%length, match-1002 ) 
putative" 
/codon start =^1 
/protein_id= » BAC3 6 513.1" 
/db_xref ="GI :26345724" 
/db_xref ="MGI : 1914081" 

/translation- "MPLFSKSHKNPAEIVKILKDNLAILEKQDKKTDKASEEVSKSLQ 
AMKEILCGTNDKEPPTEAVAQLAQELYSSGLLVTLIADLQLIDFEGKKDVTQIFNNIL 
RRQIGTRCPTVEYISSHPHILFMLLKGYEAFQIALRCGIMLRECIRHEPLAKIILFSN 
QFRDFFKYVELSTFDIASDAFATFKDLLTRHKVLVADFLEQNYDTIFEDYEKLLQSEN 
YVTKRQSLKLLGELILDRHNFTIMTKYISKPENLKLMMNLLRDKSPNIQFEAFHVFKV 
F VAS PHKTQ P I VE I LLKNQ P KL I E F L S S FQ KE RTDDE Q F ADE KN YL I KQ I RDLKKAAP 
it 

oolyA signal 1539. .1544 <'. 
/note= "putative" 

poLyA_site 1552 • 

/note= "putative" 
BASE COUNT 490 a 320 C 341 g 401 t 

ORIGIN 

Cilery Match 84.9%; Score 8.60.4; DB 11; Length 1552 ;' _ i 

riest Local Similarity 90.5^; Pred. -No. 5.8e-176; 

,.i;,i;.ch is 9 n .8; Conservative 0; Mismatches 96; Indels 0; 'Gaps., 0;. 

3 "A.TGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 60 

MMMMIMI IMIMM Mill I Mill HUM MMMM Ill Mill 

Dh; . 307 ATGAAA AAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAA.A.TTGTCAAAAT l 3 66 

61 ^ TGAAA^- ACi^TTTGGC C ATTTTGGAAAAGC AAG ACAAAAAGAC AGAC AAGG CTTCAGAA 12 0 

' i Ml Mill 1 1 1 1 1 1 1 i 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 It i M 1 1 1 

Db . 367 CTGAAAGACAACCTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 42 6 

0 ., -,21 GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 180 

|| HIM Mill Ml II I II I II Ml I MM I IMIMM II Mill M II 

n b . 427 GAGGTGTCAAAATCTCTGCAAGCAATGAAGGAAATTCTGTGTGGAACGAACGACAAGGAG 4 86 

0v jai. ^CCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 240 

Mill III IMIMM I IMIMII II II II II II II MM I II MM III 

Db 4 37 CCCCCTACAGAAGCAGTGGCTCAGCTGGCGCAGGAGCTCTACAGCAGCGGGTTGCTGGTG 54 6 

Qt - oa\ ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 3 00 

H MMMM 1 1 HI IMIIIIIIIIIIMIIIIIMIIIIIMIIIIIII 

Dfc , 547 ACACTCATAGCTGACCTGCAGCTCATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 606 

rv 3 0-1 TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 360 

^ il I M Mill I MM I Mill II I II II Ml I MM Mill Mill M III 

Db GOV TTCAACAACATCCTGAGAAGACAGATTGGTACACGGTGTCCTACTGTCGAGTACATCAGT 666 

0v GCT n ATC CTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 420 

|| IMIMM MUM MUM 1 1 IMIMII M II II I II II II II II 1 1 II II I 

Dfc 6(S7 TCTCATCCTCACATCCTGTT.TATGCTTCTOAAAGGCTATGAAGCCCCACAGATTGCCTTA 72 6 



Qy 421 CGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 480 

I! MINI Mill III Mill 1 1 Mill III MM 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 

Db 72 7 CGCTGTGGGATTATGCTAAGAGAGTGTATTCGACATGAGCCACTTGCCAAAATCATCCTA 78 6 

Qy 4 3 3 TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 54 0- 

1 1 II II I II 1 1 1 1 1 M 1 1 i 1 1 II 1 1 M M I M Ml MM II 1 1 1 1 1 II I III 

Bb 7 87 TTTTCTAATCAGTTCAGAGATTTCTTCAAGTATGTTGAGCTGTCCACCTTTGATATCGCT 84 6 

Qy 541 TCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGAC 60 0 

II Ml II II II II MIMI I MM II I MUM Mill II III MIMMIMM 

Db 84 7 TCAGATGCCTTCGCTACTTTTAAGGATTTGTTAACCAGACATAAAGTATTGGTAGCAGAC 90 6 

Qw SCV TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAG 660 

iMIMIIIIIMIMI 1 1 1 1 ! 1 1 1 1 1 1 1 1 I i 1 1 ! 1 1 1 ! 1 1 1 1 1 Ml! II MM 1 1 

Db 007 TTCTTAGAACAAAATTATGACACTATTTTTGAAGACTATGAGAAACTGCTGCAATCTGAG 966 

Qy 661 AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGQACCGTCAC 72 0. 

! I 1 1 1 1 ! 1 1 1 1 M II 1 1 1 1 1 II I Ml 1 1 M 1 1 1 I II II M II M II I ! 1 1 III 

Db ?67 AACTATGTGACAAAGAGACAATCTTTAAAGTTGCTAGGTGAGCTGATCCTGGACCGCCAC 1026 

Qy 721 AA.CTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAAC 78 0 

II II MM Mill II I II II I Mill ill I MIMI III II III II MIMI I 

Db .. ,27 AATTTCACCATTATGACGAAGTATATCAGCAAGCCAGAGAACCTGAAACTGATGATGAAC 1CX86 

T : \ 1 frrcCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAiGGTGTTT 84 0; : 1 

II MMI M IMIMMMMIMM' II MM MM Mill i I II 1 1 II I II I 

J.0 37 CrGCTTCGAGACAAAAGTCCCAACATCCAATTCGAAGCCTTCCATGTCTTTAAGGTGTTT 11-4 6 
r 941 GTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAA 900 

, Ml II Ml II Ml Mill MMI Ml I MIMI i I II II M 1 1 M I II II I II II 

.Ob ; • 4'1 GTGGCCAGCCCCCACAAAACGCAGCCTATCGTGGAGATTCTGTTAAAAAATCAGCCCAAA 12' 0.6 

Ov ^01 CTCAT^GAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGAC 960 

I M 1 1 1 M M II I i 1 1 M II I M II IMMIIMM M II MM MM MIMI 

;Ob 12 07 CTCATTGAGTTTCTGAGCAGCTTTCAGAAAGAAAGGACAGACGACGAGCAGTXTGCTGAC 12 66 

Qy 961 GAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCCGTTGA 1014 

MIMMIMM M IMI! Ml II I MM MIMI Mill I Mill Ml 

Db 12 67 GAGAAGAACTACCTGATTAAACAGATTCGAGACTTGAAGAAAGCAGCCCCGTGA 132 0 



RESULT- .2 • - ' 

LGCUS AK030474 2245 bp mRNA linear HTC 0.5-DEC-2002, 

DEF.IWil'ICN Mus musculus adult male pituitary gland cDNA, RIKEN full-length : 

enriched library, clone : 5330416K15 product : M02 5 -LIKE PROTEIN 

homolog [Homo sapiens] , full insert sequence. 
ACCESSION AK030474 

VERSION AK030474.1 GI: 26326468 

KEYWORDS- HTC; CAP trapper. 
SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
REFERENCE 1 

AUTHORS Carninci,P. and Hayashizaki , Y . 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
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TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,?., Shibata,Y., Hayatsu,N., Sugahara # Y. , Shibata,K., 

Itoh,M. f Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki , Y . 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

i 

Shibata,K. , Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N., Carninci , P . , 
Konno.H., Akiyama.J., Nishi , K. , Kitsunai , T . , Tashiro,H., Itoh,M. f 
Sumi N , Ishii,Y., Nakamura , 3 . ., Hazama,M., Ni shine, T., Harada, A. . 
Yamamoto,R., Matsumoto,H. , Sakaguchi , S . , Ikegami,T., Kashiwagi , K. , 
Fu-iwake,S., Inoue , K . , Togawa , Y . , Izawa.M., Ohara,E., Watahiki.M. , 
v 0 Aeda,Y.. l3hikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai.J., 
Okazaki,Y., Muramatsu,M. , Inoue, Y. , Kira,A. and HayashizaKi , Y . 
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aaldarelli.R., Barsh,G., Blake, J., Boffelli,D., Bo^unga.N., 
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Functional annotation of a full-length mouse cDNA collection 
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11217851 

5 
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Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
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Submitted ( 16 - JUL- 2 001) Yoshihide Hayashizaki, The Institute of.. 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome- res@gsc . riken. go . jp, 
URL:http: / /genome . gsc . riken . go . j p/ , Tel : 81-45-5 03-9222, 
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Encyclopedia Project of Genome Exploration Research Group in Riken 
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"JRL : http : / /fan torn, gsc .riken.go.jp/ . 1 
Location/Qualifiers .•■•*!"..". 
1. .2245 • • 

/organism="Mus muscuius" 
/mol typeVmRNA" 
/strain="C57BL/6J" 
/ db_xr e f = " FANT0M_DB : 5 3 3 0 4 1 6 Kl S " 
/db_xref = M taxon: 10090" 
/clone="53 3 0416K15" 
/sex="male" 

/tissue_type= n pituitary gland" 

/clone_lib= "RIKEN full-length enriched mouse cDNA library" 
/dev_stage= " adult " 
300. .1313 

/note- "unnamed protein product; M025-LIKE PROTEIN homolog 
[Homo sapiens] (SWISSPRQT | Q9H9S4 , evidence: FASTY, 
98.2%ID, 100%-length, match=1002) 
putative" 
/codon_start=l 
/protein_id="BAC26978 .1" 
/db_xref ="GI :26326469" 

/translation^ "MKKMPLFSKSHKNPAEIVKILKDNLAILEKQDKKTDKASEEVSK 
SLQAMKEILCGTNDKEPPTEAVAQLAQELYSSGLLVTLIADLQLIDFEGKKDVTQIFN 
NILRRQIGTRCPTVEYISSHPHILFMLLKGYEAPQIALRCGIMLRECIRHEPLAKIIL 
FSNQFRDFFKYVELSTFDIASDAFATFKDLLTRHKVLVADFLEQNYDTIFEDYEKLLQ 
SENYVTKRQSLKLLGELTLDRHNFTIMTKYISKPENLKLMMNLLRDKSPNIQFEAFHV 
FKVFVASPHKTQP I VE ILLKNQPKL IEFLS S FQKERTDDEQFADEKNYL I KQIRDLKK 
AAP" 

667 a 480 c • 517 g 581 t 



Ma/i : C h 84.9%; Score 860.4; DB 11; Length 2245; 

Local Similarity 90.5%; Pred. No. 6.2e-176; 

es 913; Conservative 0; Mismatches 96; Indels 0; Gaps 0.; 
1 ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 6 0 

IIIIMIIIIIIIIIIIIIIMIIMMItlMIIIIMIIMIIIIIIMM Mill 

3 00 ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTCAAAATT 3 59 
ri CTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 12 0 

1 1 1 M 1 1 1 1 1 i MlllllllliMIIMIIIIillilMIIMMIIIIIIIMMM 

3 60 CTGAAAGACAACCTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA- 419 
V'vi GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 18 0 

|| INN Mill I Ml Ml Ml IN I 1 1 1 i i 1 1 1 1 1 M M M Mill- M ! I 

4 0 GAGGTGTCAAAATCTCTGCAAGCAATGAAGGAAATTCTGTGTGGAACGAACGACAAGGAG 4 7 9 
1 8 ' CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 24 0 

~ | mm 1 1 1 1 1| | M II 1 1 1 1 1 II 1 1 M M M IMIIMMM M MM Ml 

CQ C CCCCCTACAGAAGCAGTGGCTCAGCTGGCGCAGGAGCTCTAC 53 9 

-41 ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 3 00 

1 1 1 1 1 M II II II I M I M 1 1 1 1 1 1 1 II 1 1 M I II II M II 1 1 1 1 1 Ml 1 1 1 1 II I II 

3 a 0 ACACTCATAGCTGACCTGCAGCTCATAGACTTTGAGGGAAAAAAAGATGTGACCC AGATA 5 9-9 
■ v TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTA.CTGTGGAGTATATTAGT 3 60- 

: 1 1 in Ml || | 1 1 IMIIMMM II I! Ml Mill MM I. MM I Mill 

::;0 TTCAACAACATCCTGAGAAGACAGATTGGTACACGGTGTCCTACTGTCGAGTACATCAGT ob9, 
• '■' -rTrATCCTCATATCrTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 42 0 

"'■ IIIUIMII II III Ml I Mill MM II' MM I Mill III III II Hi 

5B0 TCTCATCCTCACATCCTGTTTATGCTTCTCAAAGGCTATGAAGCCCCACAGATXGCCTTA. 719 
■■■ ' i CGTTGTGGGATTATGCTGAGAGAATGTAT TCGACATGAACCACTTGCCAAAATGATCCTC: 4 8 0 

\\ M Ml M M II II I Mill IMMi.Mil I Ml I II 1 1 1 1 1 1 1 MM 1 1 1 1 1 1 

v-0 CGCTGTGGGATTATGCTAAGAGAGTGTATTCGACATGAGCCACTTGCCAAAATCATCCTA 779 
'Hi TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 540 

M I MM MM 1 1 II I II II il 1 1 1 HIM II III MM 1 1 IMIIMMM 

7<i0 TTTTCTAATCAGTTCAGAGATTTCTTCAAGTATGTTGAGCTGTCCACCTTTGATATCGCT 839 
S^i TCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGAC 60 0 

' IMIIMMM ill Mill Ml Mill I II II II II 1 1 1 M II II 1 1 1 1 1 II I II 

V.40 TCAGATGCCTTCGCTACTTTTAAGGATTTGTTAACCAGACATAAAGTATTGGTAGCAGAC 399 



10 1 TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAG 660 

" II II ill II MM MM MIMIMIMMIIIMMMIMIIiMMIilMM 

900 TTCTTAGAAC 



^CAAAATTATGACACTATTTTTGAAGACTATGAGAAACTGCTGCAATCTGAG 9 5 9 



661 AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCAC 72 0 

II HIM II MUM 111111111 IMIMI MMMMIMIIMM III 

550 AACTATGTGACAAAGAGACAATCTTTAAAGTTGCTAGGTGAGCTGATCCTGGACCGCCAC 1019 
- '< 1 ■•XACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAAC 7 80 

|| || MM Mill Ml HIM 1 1 1 1 M 1 1 1 I II II II IMIMI Mill MM 

1 02 0 AATTTCACCATTATGACCAAGTATATCAGCAAGCCAGAGAACCTGAAACTGATGATGAAC 1079 
7 81 CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTT 34 0 



Db 

Qy 

Db 

Qy 

Db 

Qy. 

Db 



I I INN II MINI I MINIUM 1 1 MINIM Mill 1 1 1 1 1 1 1 1 1 1 1 1 

j 0 3 0 CTGCTTCGAGACAAAAGTCCCAACATCCAATTCGAAGCCTTCCATGTCTTTAAGGTGTTT 113 9 
841 GTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAA 900 

. 1 1 1 1| I II II 1 1 N II 1 1 I II 1 1 II I II II I II I II M I M ! 1 1 1 1 1 N 

114.0 GTGGCCAGCCCCCACAAAACGCAGCCTATCGTGGAGATTCTGTTAAAAAATCAGCCCAAA 1199 
oQi CTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGAC 960 

^ \\\\\\\\\\\\\\\\\\\\\\\ N INN Mill I II II M WWW 

12 0 0 CTCATTGAGTTTCTGAGCAGCTTTCAGAAAGAAAGGACAGACGACGAGCAGTTTGCTGAC 12 5 9 
GAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA 1014 

I I I i 1 1 II I II I II I II 1 1 II II 1 1 II I IN Mil I Nil I II I II III 

12 /JO GAGAAGAACTACCTGATTAAACAGATTCGAGACTTGAAGAAAGCAGCCCCGTGA 1313 
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,^053642 3039 bp . mRNA , linear HTC 05-DEC-2002 

?:iU3 musculus 0 day neonate eyeball cDNA , RIKEN full-length enriched' 
library, clone : E130116O21 product : M02 5 -LIKE PROTEIN homolog [Homo 
Sapiens], full insert sequence. 
AK053642 

AK053642.1 GI:26343600 

HTC; CAP trapper. . . .' 

■lus musculus (house mouse) 

:ius musculus ; ■ 
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•i i . 
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High-efficiency full-length cDNA cloning r ' • 

Meth. Enzymol. 303, 1 9-44 (1999) : . 
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Genome Res 
20499374 
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URL : http : / / fan torn . gsc . riken . go . j p/ . 
FEATURES Location/Qualifiers 
source 1. .3039 

/organism^ "Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/ db_xr e f - " FANTOM_DB :E130116O21 M 

/ db_xr e f = " t axon : 1 0 0 9 0 " 

/clone="E130116O21" 

/ 1 i s sue_type = " eyeba 1 1 " 

/clone__lib= "RIKEN full-length enriched mouse cDNA library" 
/dev_stage="0 day neonate" 

CDS 287. .1300 

/note- "unnamed protein product; M025-LIKE PROTEIN homolog 

[Homo sapiens] (SWISSPROT | Q9H9S4 , evidence: FASTY, 

98.2%ID, 100%length, match=1002) 

putative" 

/codon_start=l 

/protein_id= " BAG 3 5457.1" 

/db xref="GI: 26343601" 

/'translation^ "MKKMPLFSKSHKNPAEIVKILKDNIjAILEKQDKKTDKASEEVSK 
SLQAMKEILCGTNDKEPPTEAVAQLAQELYSSGLLVTLIADLQLIDFEGKKDVT.QIFN 
NILRRQIGTRCPTVEYISSHPHILFMLLKGYEAPQIALRCGIMLRECIRHEPLAKIIL 
FSNQFRDFFKYVELSTFDIASDAFATFKDLLTRHKVLVADFLEQNYDTIFEDYEKLLQ 
SEN YVTKRQSLKLLGEL I LDRHNFT IMTKY I SKPENLKLMMNLLRDKS PNI QFEAFHV 
FKVFVASPHKTOPIVEILiiKNQPKLIEFLSSFQKERTDDEOFADEKNYLIKQIRDLKK 

AAP" 

■iolyA .. ro.jnal 3022. .3027 - . - : 

/note- 'putative" 

Voly^ :>i^e 303 9 '' 

/noce= "putative" 

.BASE GOr^rr 882 a 645 c ^ 7 6 q 336 t . ! 

ORIGIN 

Qu-rv Match 54.9%; Score 860.4; DB 11; Length 3039; ;■ ■ • " 

Bost^ Local Similarity 90.5%; Pred. No. 6.6e-176; 

fetches 918; Conservative 0; Mismatches 96 ; Indels 0; Gaps 0; 
n.. ' i ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 60 

MMMMMMMMMMMMMMMMMMMMiMMIMIMM Mill 

. Db 2 3-7 ATGAAAAAAATC^^ 346- 

r , PTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 12 0 

ilMIMMII MMMMMMMiMMMIMMMMMMMMMMMMI 

Bb 347 - CTGAAAGACAACCTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 40* 

nv 1'>1 GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 18 0 

" II HIM Mill 1 1 II I Mill MM MUM I Mill M II Mill II II 

Db 407 GAGGTGTCAAAATCTCTGCAAGCAATGAAGGAAATTCTGTGTGGAACGAACGACAAGGAG 466- 

^ , fl CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 24 0? 

'"' MMI Mill II III MM MUM- II II II I Mill M III I! MM Ml 

Db 4 6 7 ccccCTACAGAAGCAGTGGCTCAGCTGGCGCAGGAGCTCTACAGCAGCGGGTTGCTGGTG 5 2 6 

nv -'I ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 3 00, 

y MMI Ml MM II I MM Ml I II II II II 1 1 M II M M M I II 1 1 II II II II 1 1 

Db 527 ACACTCATAGCTGACCTGCAGCTCATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 58.6 



Qy 



\J.O 



0\ 



0y 301 TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAG^ 360 

587 Uc^CAACA^ 646 
ncTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 42 0 : 
Qy ViCTCATCC TCATATCCTu | | | | | | | | | I I I I I I I I I I 

647 TCTCATCCTCACATCCTGTTTATGCT^ 706 
Qy 42-1 CGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCAC^ 480 

Y M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

Db 707 CGCTGTGGGATTATGCTAAGAGAGTGT 766 

4»1 TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGAT 540: 
M I I I I M II I I i I 1 I I i I I I I I I I M Ml I I I I I I I I I I I I I II I 

Db 7,7 UUctIaIcAG^ 826 ' 

541 TCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAA AGTGTTGGTAGCAGAC 6 00 
,41 fCACATCCC 11111111' I I I I I I I I I I I I I M I 111 III III Ml 

601 TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAG 660 
f 601.TTCTTAGAA | | | | | | | | | U I M II I I M I II I II I MM II MM II - 

., R8? ; tcttag/J^caaIattai'G^ 946 ; 

: ''?.TTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATGCTC 72.0 • 

,7 1 1 iii is !| ii MM I MM MM I MM 1 1 MIMiMMMMMI Ml 

;x?\CTATGTGACAAAGAGACAATCTT^ 10Qi >' 
■ AACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATC 7 80., 

I* ! i ill! MMI MIMIMMMMMI I II i M 1 1 1 1 II II II II Ml M 
,, .. 0 : v/ ^ T ttcacc1ttatgaccaagtatatcagcaagccagag W«6 

'71,1 CTCCTTCGGGATAAAAGTCGCAACATCCAGTTTGAAGCCT^ 84* • 

il ii M | || MIMIMMMMMI 1 1 ! 1 1 1 MMI MM I II MM i 

i0 67 CTGCTTCGAGACAAAAGTCCCAACATCC * 12 * 
C.,',1 GTGGCCAGTCCTCACAAAACACAGCC'rATTGTGGAGATCCTCT 90:0. 

1 1 ! 1 1 1 1 1 || || M I II I II 1 1 1 II II M i I II I II M II I II I II II 

Db .:127 GTGGCCAGCGCCCAciJvAAC^ 1186 

,01 CTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTC<3CTGAC 960 

■ Mill nil III HIM MM II II Mill M 1 1 M MM M Mill I 

D> il.S7 CTCATTGAGTTTGTGAGC^ 124 ^' 

<,,-, GAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA 1014 

> : " HIM MMI II II Ml III Mill IMM II III Mill I Mill III 

Db 1247 GAcJIaGAACT^CCTG 1300 

WLiuhT 4 

™' 768 /K076756 1377 bp tnRNA linear HTC 07-DEC-2002 

DEFINITION' Mus musculus adult male testis cDNA, RIKEN full-length enriched^ 
DEFINITION «. brary< c ione:4930433N18 product : M02 5 -LIKE PROTEIN homolog [Homo ; 

sapiensl , full insert sequence. 
ACCESSION AK076758 
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SOURCE 
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YCRNAL 
il.DLINE 
-j'BMED 
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JOURNAL: 
MEDLINE 
PUBMED 



AK076758 .1 GI : 26345637 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Butheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

Carninci,P. and Hayashizaki , Y . 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki , Y . 

Normalization and subtraction of cap- trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new gene 

Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

3 . „ 

Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N., Carnmci,P. 
Xonno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., ltoh,M. , 
<:umi,N., Ishii,Y., Nakamura, S. , Hazama,M (/ Nishine,T. , Harada,A. , 
Yamanoto,R., Mat sumo to, H . ; Sakaguchi , S . , Ikegami,T., Kashiwagi , K. , 
vuiiwake,S., Inoue,K., Togawa , Y . , Izawa , M . , Ohara, E . , Watahiki , M . , 
Ycneda,Y., Ishikawa,T., Ozawa, K. , Tanaka, T. , Matsuura,S., Kawai,J. 
Okazaki,Y., Muramatsu , M . , Inoue,Y., Kira.A. and Hayashizaki , Y . 
RIKHN integrated sequence analysis (RISA) system- -384 -format. ■ 
sequencing" pipeline with 384. mult i capillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
'.1076861 

■a . i * ■ C r • ' 

^awai,J., 3hinagawa,A. , Shibata,K., Yoshino,M. , Itoh,M., lshii v Y., 
Arakawa,T. , Kara, A., Fukunishi , Y . , Konno,H., Adachi , J . , Fukuda,S., 
Aizawa,K., Izawa,M.-, Nishi,K. , Kiyosawa/H., Kondo,S., Yamanaka,I., 
Saito,T., Okazaki,Y., Gojobori,T., Bono,H., Kasukawa,T., Saito,R., 
Kadota,K., Matsuda,H., Ashburner , M . , Batalov,S., Casavant,T., 
:neischmann,W. ; Gaasterland, T . , Gissi,C, King , B . , Kochiwa , H . , 
Kuehl,P., Lewis, S, , Matsuo, Y . „ Nikaido , I . , Pesole,G., 
Ouackenbush, J. , Schriml , L . M . , Staubli , F ., Suzuki , R . , Tomita,M., 
■Wagner f L. f Washio,T. f Sakai,K., Okido,T., Furuno,M. , Aono,H. , 
Baldarelli,*., Barsh,G., Blake, J., BoffeIli,D., Bojunga,N., 
Carninci,P., de Bonaldo , M . F . , Brownstein, M . J . , Bult,C, 
Fletcher, C, Fujita,M., Gariboldi , M . , Gustincich, S . , Hili,D. f 
Hofmann,M., Hume, D. A., Kamiya,M., Lee , N . H . , Lyons , P . , 
Marchionni,L. , Mashima,J., Mazzarelli , J . , Mombaerts , P . , Nordone,P 
Ring,B., Ringwald,M., Rodriguez , I . , Sakamoto, N. , Sasaki, H., 
Sato,K., Schonbach,C. , Seya,T., Shibata,Y., Storch,K.F., Suzuki , H 
Toyo-oka,K., Wang,K.H., Weitz,C, Whittaker , C . , Wilmmg,L., 
Wynshaw-Boris # A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki , Y. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409 (6821), 635-690 (2001) 

21085660 

11217851 
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, Akimura,T., Aono,H., Arai,A., 
, Fukuda , S . , Fukunishi , Y . , 
Hayatsu,N. , Hiramoto , K . , 
Ishii,Y., Itoh,M., Izawa,M. 



TITLE 
JOURNAL 



COMMENT 



TEATURES 

Source 



CDS 



The FANTOM Consortium and the RIKEN Genome Exploration Research 

Salyslfof the mouse^rans crip tome based on functional annotation 
of 60, 770 full-length cDNAs 
Mature 420, 563-573 (2002) 
6 (bases 1 to 1377) 
Adachi,J-, Aizawa,K., Akahira,S. 
Arakawa,T., Bono,H., Carninci,P. 
Furuno , M . , Hanagaki , T . , Hara , A . , 
iri r^nka T . Hori,F., Imotani , K . , _ 

SsuS^x'., Kato,H., Kawai.J., Ko j ima, Y Konno, H Kouda, M . , 
Koya,S., Kurihara.C, Matsuyama,T. , Miyazaki,A. , Nishi,K. , 
Nomura,K., Numazaki,R., Ohno,M., Okazaki,Y., Okido.T Owa, C . . 
SaitoH., Saito,R., Sakai.C, Sakai.K,, Sano,H., Sasaki, D 
Shibata.K.. Shibata,Y. ( Shinagawa, A. , Shiraki,T., Sogabe,Y., 
Suzuki, H., Tagami,M., Tagawa,A., Takahashi , F . , Tanaka T 
Tejima,Y., Toya,T., Yamamura , T . , Yamanaka, I . , Yasunishi , A. , 
Yoshida,K'. , Yoshino,M., Huramatsu,M. and Hayashizaki , Y . 

3ubmitted bl (i6-A?R-2002) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for enom 
exploration Research Group, RIKEN Genomic Sciences center (GSC) 
■IKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
ranaqawa 230-0045, Japan (E-mail :genome-res@gsc . rxken. go . jp, 
: http r / /genome . gsc . riken .go.jp/, Tel :. 81 -4 5 -503 -9222 , 
^X:81-45-503-9216) ' .. 
-HA library was prepared and sequenced in Mouse Genome 
C yclopedia Project of Genome Exploration Research Group in Riken 
Gnomic Sciences Center .and Genome Science Laboratory in R KM. 
Division of Experimental Animal Research in Riken contributed to 
or' pare mouse tissues. 

Please visit our web site for further details. 
tJRli: http: //genome. gsc. riken.go.jp/ 
URL : ht tp : / / f antom .gsc. riken . go . j p/ . 
Location/Qualifiers 
1. .1377 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="C57BL/6J" 

/ db_xr e f = " FANTOM_DB : 4 9 3 0 4 3 3N1 8 " 
/ db_xr e f = "MGI : 18 94 857 " 
/ dbjxr e f =* " t axon : 1 0 0 9 0 " 
. /clones " 4 93 04 3 3N18 " 
/sex="male" 
/tissue typ^- M testis n 

/clone_lib="RIKEN full-length enriched mouse cDNA library 
/dev_stage= "adult" 

% 0 te=»unnamed protein product; M025-LIKE PROTEIN homolog 
[Homo sapiens] (SWISSPROT | Q9H9S4 , evidence: FASTY, 
98.2%ID, 100%length, match=1002) 
putative" 
/codon_start=l 
/prot e in_id= " BAC3 64 7 0.1" 
/db_xref="GI : 26345633 " 
/db_xref="MGI: 1914081" 



Cv 
Lb 

Qy 
Db 



/translations "MKKMPLFSKSHKNPAEIVKILKDNLAILEKQDKKTDKASEEVSK 
SLQAMKEII.CGTNDKEPPTEAVAQLAQELYSSGLLVTLIADLQLIDFEGKKDVTQIFN 
NILRRQIGTRCPTVEYISSHPHILFMLLKGYEAPQIALRCGIMLRECIRHEPLAKIIL 
FSNQFRDFFKYVELSTFDIASDAFATFKDLLTRHKVLVADFLEQNYDTIFEDYEKLLQ 
SENYVTKRQSLKLLGELILDRHNFTIMTKYISKPENLKLMMNLLRDKSPNIQFEAFHV 
FKVFVASPHKTQPIVEILLKNQPKLIEFLSSFQKERTDDEQFADEKNYLIKQIRDLKK 

AAP" 

3ASB COUNT 430a 294c 306 g 347 t 

ORIGIN 

Query Match 84.7%; Score 858. 8; DB 11; Length 1377; 

ierit Local Similarity 90.4%; Pred. No. 1.3e-175; 

Matches 517; Conservative 0; Mismatches 97; Indels 0; Gaps 0; 

i ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 60 

' I ! 1 1 1 M 1 1 1 1 1 1 1 ! M 1 1 1 1 1 1 i 1 1 1 1 1 1 ! 1 1 1 1 ! 1 i I M 1 1 1 1 1 1 1 1 ] 1 1 1 Mill 

2-37 ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTCAAAATT 34 6 
<~1 (-TGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 12 0 

I llitlllll IIIIIIIMII 1 1 1 1 1 1 1 1 1 1 ! I ! 1 1 1 1 1 ! i 1 1 1 1 1 1 1 1 1 " 

347 CTGAAAGACAACCTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 4 06 
■>-»V ^AAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAACAA 18 0 : 

* Tl mil Mill Ml Mi Ml IMMMMMMI 1 1 Mill II II 

•„ 7 qagGTGTCAAAAT.CTCTGCAAGCAATGAAGGAAATTGTGTGTGGAACGAACGAGAAGGAG 4 6,6. 

• - rcCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 24* : 

' MM! 1 1 ! t i 1 1 1 . 1 1 1 ! I i i 1 1 ! | s| MM II II II 1 1 II I M MM III 

\-rri CCCCCTACAGAAGCAGTGGCTCAGCTGGCGCAGGAGCTCTACAGCAGCGGGTTGCTGGTG 52b.' 
-A i ACACTGATAGCTGACC rGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGAC.CCA.GATA 300 . 

■ ! M M i i 1 1 Ml I! M I II M M I M M M M M M I M II 1 1 1 1 M II 1 1 M 

y.;yi ACACTCATAGCTGACCTGCAGCTCATAGACTTTGAGGGAAAAAAAGATGTGACeCAGATA d86> 
tTTAACAACATCTTGAGAAGACAGATAGGCACTCGG^GTCCTACTGTGGAGTATATTAGT 36S) 

" M i il Ml I i I II I i I M Mi III - I I II III MM Mi Ml I MM II III 

6 3 1 ttcaACAACATCCTGAGAAGACAGATTGGTACACGGTGTCCTACTGTCGAGTACATCAGT 64 6 

• -l ."•(-"TCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 420 

llitlllll I I II Ml Ml Mill Mill III II MUM II MM Mlllllill ■ 

547- tCTCATCCTCACATCCTGTTTATGCTTCTCAAAGGCTATGAAGCCCCACAGATTGCCTTA 7 06 
"I gGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 4 80 

M II II 1 1 MM II 1 1 Mill MMMMMMM I II MUM Mill I Mill 

707 CGCTGTGGGATTATGCTAAGAGAGTGTATTCGACATGAGCCACTTGCCAAAATCATCCTA 7 66 
^ :, TTTTCTAATCAATTC AGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 540- 

' mil II MM I III Mill Mill Mill II Ml MM II I II Mill III 

TTTTCTAATCAGTTCAGAGATTTCTTCAAGTATGTTGAGCTGTCCACCTTTGATATCGCT 8 2 6 
- I T r AGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGAC 6 00 

II Ml IM Ml MM III I Ml Mill I II 1 1 M 1 1 1 II II II II I M 

8 2 7 TCAGATGCCTTCGCTACTTTTAAGGATTTGTTAACCAGACATAAAGTATTGGTAGCAGAC 886 
da'- TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAG 660 

MMMIMMIIMM II II 1 1 II I II 1 1 II 1 1 II I II II II I MM II I MM I 

887 TTCTTAGAACAAAATTATGACACTATTTTTGAAGACTATGAGAAACTGCTGCAATCTGAG 946 



:,b 



Db 767 

Qy 

Db 
Qy 
Db 



Qy 


661 


Db 


947 


Gy 


72 1 


Db 


1007'* 


Qy 


781 


Db 


1067 


Qy 


341 


nb 


1127 


Qy 


901 


Db 


1.187 


Qy 


961 




12 4 7 



AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCAC 7 2 0 

II IMM II MINIM 1 1 1 ! I It 1 1 MMMI Mill III 

AACTATGTGACAAAGAGACAATCTTTAAAGTTGCTAGGTGAGCTGATCCTGGACCGCCAC 100 6 



MM MM Mill 1 1 1 1 1 II M M M I M I MMMI Mill 



CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTT 84 0 

II Mill M Mill II Mill I Ml I M Ml Mill IMM I II IMM I Ml 

CTGCTTCGAGACAAAAGTCCCAACATCCAATTCGAAGCCTTCCATGTCTTTAAGGTGTTT 112 6 



MM MM M I MUM I MM MM II I Mill II II It I M 1 1 1 II M M II I 

GTGGCCAGCCCCCACAAAACGCAGCCTATCGTGGAGATTCTGTTAAAAAATCAGCCCAAA 
CTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGAC 

I Ml 1 1 MUM MMMI Ml M MMMI MM II M Mill III II MM 

CTCATTGAGTTTTTGAGCAGCTTTCAGAAAGAAAGGACAGACGACGAGCAGTTTGCTGAC 
GAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA 1014 

MUM MUM Ml MM II Ml I 1 1 II I II III M II I I Mill III 



?.K0 13 2 05 

locus 

definition 
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A^_013205 1449 bp mRNA linear HTC 05-DEC-20O2' 

Mus musculus 10, 11 days embryo whole body cDNA, RIKEN full-length 
enriched library, clone : 2 81043 0N08 product : M025 -LIKE PROTEIN 
homolog [Homo sapiens] , full insert, sequence. 
AK013205 

AK013205.1 GI:12850419 . 

HTC; CAP trapper. 

Mus musculus (house mouse) 

Mus musculus 

Eukaryota; Meuazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae ; Mus. ■ 
1 

Carninci,P. and Hayashizaki , Y . 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K.., 
Itoh,M., KonncH., Okazaki,Y., Muramatsu,M. and Hayashizaki , Y . 
Normalization and subtraction of cap-trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new genes 
Genome Res. 10 (10), 1617-1630 (2000) 

20499374 ' • 

11042I5S 
3 

Shibata, K. , Itoh,M., Aizawa, K. 
Konno,H., Akiyama,J., Nishi,K. 
Sumi,N., Ishii,Y., Nakamura,S. 



Nagaoka , S . , Sasaki , N . , Carninci , P . 
Kitsunai,T. , Tashiro,H. , Itoh,M. , 
Hazama,M., Nishine,T., Harada,A., 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



JOURNAL. 
MEDLINE 
I'UBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



TITLI-j 
JOURNAL 



Yamamoto,R., Matsumoto,H. , Sakaguchi , S . , Ikegami,T., Kashiwagi , K. , 
Fujiwake.S., Inoue.K., Togawa,Y., Izawa.M.., Ohara,E., Watahiki,M., 
Ycneda.Y., Ishikawa,T., Ozawa,K., Tanaka , T . , Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu, M . , Inoue,Y. ( Kira,A. and Hayashizaki , Y . 
RIKEN integrated sequence analysis (RISA) system- -384- tormat 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 

Kawai,J., Shinagawa,A. ( Shibata,K., Yoshino,M., Itoh,M. , If 1 * 1 '*-' 
Arakawa,T.. Hara,A., Fukunishi , Y . , Konno,H., Adachi,J., Fukuda,S., 
Aizawa,K., Izawa,M., Nishi.K.. Kiyosawa,H., Kondo,S., Yamanaka , I . , 
Saito T , Okazaki,Y., Gojobori,T., Bono,H., Kasukawa,T., Saito, R., 
Kadota K , Matsuda.H., Ashburner,M. , Batalov,S., Casavant,T., 
Fleischmann,W., Gaasterland, T. , Gissi.C, King.B., Kochxwa,H., 
Xuehl.P., Lewis, S., Matsuo,Y. , Nikaido, I. . Pesole,G., 
Quackenbush,J., Schriml , L .M . , Staubli , F . , Suzuki, R., Tomita.M., 
Wagner, L . , Washio,T., Sakai,K., Okido,T., Furuno ( M., Aono,H.. 
3aldarelli,R., Barsh,G., Blake, J., Boffelli.D., Bojunga,N. , 
Carninci P., de Bonaldo, M . F . , Brownstein, M . J . , Bult,C, 
Fletcher, C, Fujita.M., Garibdldi , M . , Gustincich, S . , Hxll,D., 
Hofmann,M., Hume , D . A . , Kamiya,M., Lee , N . H . , Lyons, P., 
Marchionni,L., Mashima, J.., Mazzarelli , J . , Mombaerts , P . , Nordone, P., 
*:ing,B., Ringwald, M. , Rodriguez , I . , Sakamoto, N . , Sasaki, H., 
Ca:-o,K., Schonbach,C, Seya,T., Shibata,Y., Storch,K.F., Suzuki, H. , 
"oyo-oka,K.. Wang,K.H., Weitz.C, Whittaker , C . , Wilmmg,L., 
Wynshaw-Boris,A., Yosh:da,K., Hasegawa, Y . , Kawaj i , H. , Kohtsuki,S. 
and Hayashizaki, Y. 

functional annotation of a- full-length mouse cDNA collection 
Nature 4 09 (6821) , 685-690 (2001) ' . 

21085660 
11217851 

The FANTOM Consortium and thti RIKEN Genome Exploration Research 
Group Phase I & II Team. . . 
Analysis of the mouse transcriptome based on functional annotation 

of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 1449) 
Adachi,J., Aizawa,K., Akahira,S. 
Arakawa , T . , Bono , H . , Carninci , P . 
Furuno , K . , Hanagaki , T . , Hara , A. , 
Hiraoka,T., Hori.F., Imotani, K. , 

Kasukawa,T., Kato,H., Kawai,J., Kojima,Y., Konno,H., Kouda,M., 
Koya.S., Kurihara,C, Matsuyama, T . , Miyazaki,A., Nishi,K., 
Nomura, K. , Numazaki,R., Ohno,M., Okazaki,Y., Okido, T . , Owa , C . , 
Saito, H., Saito, R., Sakai,C, Sakai,K., Sano,H., Sasaki, D 
Shibata,K., Shibata,Y., Shinagawa, A. , Shiraki,T., Sogabe,Y., 
Suzuki, H., Tagami,M., Tagawa,A., Takahashi , F . , Tanaka, T., 
Tejima,Y., Toya,T. ( Yamamura.T., Yasunishi , A. , Yoshida,K., 
Yoshino,M., Muramatsu, M. and Hayashizaki, Y. 

Direct Submission . , 

Submitted (10-JUL-2000) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN), Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 



, Akimura,T., Arai,A., Aono,H. 

, Fukuda,S., Fukunishi, Y. , 
Hayatsu,N., Hiramoto,K., 
Ishii,Y., Itoh,M., Izawa,M. 



Kanagawa 230-0045, Japan (E-mail :genome-res@gsc . riken .go .jp, 
URL: http://genome.gsc.riken.go. jp/, Tel : 81-45-503-9222 , 
Fax:81-45-503-9216) 

COMMENT Please visit our web site (http://genome.gsc.riken.goop/) for 

further details. 
. . cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group m Riken 
(Gnomic Sciences Center and Genome Science Laboratory in RIKEN . 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. First strand cDNA was primed with a primer 
*["5 ■ GAGAGAGAGAAGGATCCAAGAGCTCTTTTTTTTTTTTTTTTVN 3 • ] , cDNA was 
prepared by using trehalose thermo-activated reverse transcriptase • 
arid subsequently enriched for full-length by cap- trapper. cDNA went 
through one round of normalization to Rot = 7.5 and subtraction to 
Rot = 37.5. Second strand cDNA was prepared with the primer adapter 

of sequence [5' , 
GAGAGAGAGATTCTCGAGTTAATTAAATTAATCCCCCCCCCCCCC 3 ■ ] . cDNA was cleaved 

with Xhol and Sstl. Cloning sites, 5' end: Xhol; 3' end: Sstl. 
Host : SOLR. 
FEATURES Location/Qualifiers 
"-"" source "1- -1449 

/organism="Mus musculus" ■ 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db xref="FANTOM_DB: 2 81043 ONO 8 " 

/db xrefV-"MGI: 1893512" 

/ db__:<re f - " t axon": 10 0 90" 

/cioue= ,, 2310430N08' 1 ' .- 

■/i-insue__type= "whole body" 

/clone_lib== "RIKEN full-length .enriched mouse cDNA library" 
/dev 3tage="lC, 11 days embryo" 
; •• : ■ 1 • 4 .:iire 281. ".1292 ; r.^c. -az' 

/note="M025-LIKE PRGTEIN homolog [Homo sapiens] 
(SWISSPROT|Q9H9S4, evidence: FASTY . 98 . 2%XD, 100%le:ngth, 
mat ch= 1002) 
putative" 

/db_xref="MGI : 1914081" 
453 a 304 c 325 g 367 t : " v - ■ ■ 

QU3-Y Match 83.3%; Score 844 . 8 ; DB 11; Length 1449; - 

■.Meat Local Similarity 90.2%; Pred. No. 1.4e-172; ■ 
•batches':- 915;' Conservative 0; Mismatches 97; Indels 2; Gaps, 1; 

1 AXGAAAAAAA TGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 60 

IIIIMIIMiMi MMMMMMMMMIMMMIIMMMMMI Mill 

L , . 2 8.1 ATGAAAAAAATGCC^ 34 0 

Ov 61 CTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 120 

II llllll Ml Ml II I MUM I Ml II I Mill MUM Ml II MUM II MM 

341 CTGAAAGACAACCTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 4 00 
191 GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA ;80 

II || III' Mill II MUM MM II 1 1 II 1 1 M II II II II Mill II If 

4 0 1 GAGGTGTCAAAATCTCTGCAAGCAATGAAGGAAATTCTGTGTGGAACGAACGACAAGGAG 4 6 0 
181 CCCCCAACAG AAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 2 4 0 



Db 

Cy 
Db 

Qy 



Mill INN M Mill MINN I li II M I 111 II MM I II MM Ml 

Db 461 CCCCCTACAGAAGCAGTGGCTCAGCTGGCGCAGGAGCTCTACAGCAGCGGGTTGCTGGTG 52 0 

Qy 241 ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 3 00 

Mill II Mill Mill II ill 1 1 MINI Mill MMI1 1 1 Mill II MM III 

Db 521 ACACTCATAGCTGACCTGCAGCTCATAGACTTTGAGGGAAAAAGAGATGTGACCCAGATA 580 

Qy 301 TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 3 60 

II Mill MM Mill II Mill I II II Mi 1 1 MUM II Mill II III 

Db 581 TTCAACAACATCCTGAGAAGACAGATTGGTACACGGTGTCCTACTGTCGAGTACATCAGT 6 40 

Qv 3 61 GCTCATCCTCATATCCTGTTTATGCTCGTCAAAGGATATGAAGCCCCACAGATTGCCTTA 420 

Mill II Ml IMIMMIMIM MM I II I I II MM MUM Ml 1 1 Mill I 

Db £41 TCTCATCCTCACATCCTGTTTATGCTTCTCAAAGGCTATGAAGCCGCACAGATTGCCTTA 7 00 

Qv :*21 CGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 4 80 

II IMIMMIMIM Mill Mill Ml III ML II II II 1 1 II I M II 1 1 1 II 

Db 7 01 CGCTGTGGGATTATGCTAAGAGAGTGTATTCGACATGAGCCACTTGCGAAAATGATCCTA 7 60 

Qy 5 -31 TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 540 

MUM Mill M Mill III MM MM 1 1 1 1 1 I II I M II 1 1 1 1 II 1 1 1 

Db 761 TTTTCTAATCAGTTCAGAGATTTCTTCAAGT - - GTTGAGCTGTCCACCTTTGATATCGCT 818 

Qv ; 1 TCAGATGCGTtTGCTACTTTCAAGGATTTACTAACCAGACAT AAAGTGTTGGTAGCAGAC 600, 

. Mill Ml III MINIM Mill III 1 1 M I M MINIMI 1 1 1 1 1 1 ! 1 1 1 1 1 

Db . TCAGATGCCTTCGCTACTTTTAAGGATTTGTTAACCAGACATAAAGTATTGGTAGCAGAC 8 7 ! 8" 

Qv .1 'rrCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAOTCTGAG 660C 

. ! I II II 1 1 1 1 II 1 1 1 ! I 1 1 MM 1 1 1 i 1 1 II II 1 1 i II I II 1 1 1 INI II MINI 

. N • /? TTCTTAGAAC^AAATTATGACACTATTTTTGAAGACTATGAGAAACTGCTGCAATCTGAG .93®" 
O Cb 1 ■ Ai\T.TATGTTACTAAGA.GACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACt'GTCAC 7 20 

! I Mill M 1 1 1 1 1 1 II II 1 1 1 II M MM Ml IN! Mill Ml I Nil Ml 

" 93 5 AACTATGTGACAAAGAGACA^TCTTTAAAGTTGCTAGGTGAGCTGATCCTGGAGCGCCAC 9 93 
0? 721 AACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAAC 780 

II II INI INN M Ml I M Ml 1 1 1 M I II II II I II III 1 1 INN Nil 

Dl' AATTTCACGATTATGACGAAGTATATCAGCAAGCCAGAGAACCTGAAACTGATGATGAAC 105 8 

Qy '>8I CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTT 84 0 

II II III II MMIM MINI I Ml II' Mill I II IJM I III II MM Ml 

Db , 1 0 5 9 CTGCTTCGAGAC AAA AGTCCCAAC ATCCAATTCGAAGCCTTCCATGTCTTTAAGGTGTTT 1 1 18 

Qv SA1 GTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAA 900' 

I I III II I II II INI N INN II I II I MM I MMIM MMIM MMM I 

Db 1119 GTGGCCAGCGCCCACAAAACGCAGCCTATCGTGGAGATTCTGTTAAAAAATCAGCCCAAA 117 8 

Qy 901 CTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGAC 960 

MM! MM MM MMM MM II MM MM II N III I INI INN I 

Db 1179 CTCAXTGAGTTTCTGAGCAGCTTTCAGAAAGAAAGGACAGACGACGAGCAGTTTGCTGAC 123 8 

Qy 561 GAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAA'CGGCCCCTTGA 1014 

II I III MINI NNINIIMM NNI INN INN ! INN III 

Db 12 3 9 GAGAAGAACTACCTGATTAAACAGATTCGAGACTTGAAGAAAGCAGCCCCGTGA 12 92 



RESULT 6 



AU125107 
LOCUS 

DEB' INITIOS 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
-COMMENT 



EASE C^jHT 
ORIGIN 



AU125107 822 bp mRNA linear EST 01-AUG-2002 

AU125107 NT2RM4 Homo sapiens cDNA clone NT2RM4001047 5', mRNA 
sequence . 
AU125107 

AU125107.1 GI: 10949823 
EST. 

Komo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 822) 

Ota,T., Wakamatsu, A. , Ozawa,M., Ishii,S., Saito,K., Yamamoto f J . , 
Nakamura,Y., Nishikawa.. T. , . Nagai, T . , Suzuki, Y., Sugano,S. and 
Isogai, T. 

HRI human cDNA project (Ota,T., Wakamatsu, A. , Ozawa,M., Isnn,S., 

Saito,K. Yamamoto, J. , Nakamura, Y. , Nishikawa, T. , Nagai, T., Suzuki 

,Y,, Sugano,S., Isogai, T.) 

Unpublished 

Contact: Takao Isogai 

Genomics Laboratory 

Helix Research Institute 

1532-3 Yana, Kisarazu, Chiba 292-0812, Japan 
Tel : 81-438-52-3975 
Fax: 31-438-52-3986 

■pail: jenomics@hri.co.jp 
JRI human cDNA project; 5'- & 3 » -end one pass sequencing: Helix 
Research Institute; cDNA library construction: Department of f 
Viroloqy, Institute of Medical Science, University of Tokyo, and 
•:.elix Research Institute. 

Location/Qualifiers : 
. 1. .822 '• " ; 

/organisms "Homo sapiens" 

/mol_type-"mRNA" 

/db_xref ="taxon: 9606" 

/clones "NT2RM4 001047" 

/cell_type="teratocarcinoma" 

/cell_line="NT2" 

/clone_lib= "NT2RM4 " 

/note-"Vector : pME18SFL3; mRNA from uninduced NT 2 neuronal 
precursor cells" 
268 a 164 c. 171 g 216 t 3 others 



Query Match 76.0%;, Score 770.6; DB 9 

Best :Loca|i Similarity 98.5%; Pred. No. 1.5e~156 
Matches 798; Conservative 0; Mismatches 10 



Length 822; 

Indels 2; Gaps 



2; 



07 
Db 

Qy. 
Db 

Qy 



1* TTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTTGGCC 7 8 

[ I I i t I ! I 1 I i 1 I I 1 I ! I I I I I I I I I I i I I I I I I I » I ! I I i I I I I t » I I I t I » I ' ^ ' ' ' fin 

1 TT T AGT AAATC AC AC AAAAAT C C AGC AG AAATTGTG AAAA T C CTG AAAG AC AATTTGGC C 60 
7 9 ATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAAGAAGTGTCTAAATCACTG 13 8 

' il Mill MM II 1 1 MINIUM II I II I Mill I M 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 MM 

.61 ATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAAGAAGTGTCTAAATCACTG 12 0 



13 £ CAAG 



CAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAACAGAAGCAGTG 



198 



I Mill II II MM II MM III MM I MUM Ml II III Mill MM 1 1 IMIMI 

Db 121 CAAGCAATgIU^ 180 

1 99 GCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGATAGCTGACCTG 2 5 8- 

Qy MM 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III Mill I Ml II Ml I Ml 

Db isi gctcagcta^^ 240 

„.„ PAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAACATCTTGAGA 318 
Qy 2.9 .AGCTGATAGACTTT ,,,,,,,,,,,,,,,, | , | , , | , | | | | | | , | | | , | | | | | | | 

Db 241 CAGCTGATAGACTTTdA^ 300 

0y , 1 9 AGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGTGCTCATCCTCA^ 3 7 8 

" 1 1 1 1 1 1 1 1 1 1 1 1 M I M M M 1 1 1 1 1 ! M M II i I M 1 1 1 1 1 1 1 M II I II II 1 1 1 1 1 

-01 AGP CAGATAGGCACTCG^ 360 



Db 

Qy 

Db 

Qv 
Db 



•V 



Db 



,79 ITTATGCTCCTCAAAGGATATGAAGCGCCACAGATTGCCTTACGTTGTGGGATTATG 438 

1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 Ml 1 1 1 i 1 1 1 1 II I II I M 1 1 I Ml II 1 1 1 1 1 1 1 II 1.1 

.3*1 TTTATGCTCCTC/J^GGATA 42 °- 
,,D AGAGAATGTATTCGACATGAAGCACTTGCCAAAATCATCCTC.TTTTC 458 

1 1 U 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM II I M I II I II II 1 1 II I 

,-2;l- AGAGAATGTATTCGACATC 48D ' 
f JiTTTCTTTAAGTACGrGGAGTTGTCAACATTTGATATTGCTTCAGATGCCTTTGCTACT 558- ■ 

' " ' 1 1 1 1 1 MIMIMMMMIMMIMIMMMMIMMMMMMIMIIIIMI 

: v, rTCAAGGATTTACTAi^CCAGACATAAAGTGTTGGTAGCAGACTTCTTAGAACAAA^ 61SM= 

"• i 1 1 1 1 1 M 1 1 i 1 i 1 1 ! ! 1 1 1 1 1 1 M ! I i 1 1 ! 1 1 M 1 1 M M 1 i 1 1 M ! M ! ! 1 1 1 1 1 ! M 

TTC/WlGcJvrTTACT 600; ' 

c-* gacactatttttgaagactatgagaaattgot^ 

" M I M II 1 1 1 1 1 1 II i 1 1 II II II Ml I! 1 1 II II M I M II I! M M 1 1! 1 1 II Ml 1 1 n 

,01 GACACTATTT^ «"> 
CAGTCTTTAAAGCTGGTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATC 738. 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml M 1 1 II II I II II II I II II 1 1 II II I II M I II I II I II II 

C-l CAGTCTTTA^GCTGCT 72 °' 
729 AAGTATATC^GCAAGCCGGAGAACCTGAAACTCATGATGAACCTCCT^ 798 

I I III II I II II I MM I I III Ml II M III II I II III II III II II IMM 

. AAGTATATCAGC^^GCC^ 7 ' 



7 ; :>9 CCCAACATCCAGTTTGAAGCCTTTCATGTT 82 8 

r Mill II Mill! MM Ml Ml 

7 7.9 CCCAACATCCAGTTTGAGCCTTCTGGTTTT 808 



97 
Db 

RESULT 7 

r'Sfio 735 -X393735 1201 bp mRNA linear EST 13-MAY-2003 

dSfINII-ON 3X393735 Homo sapiens NEUROBLASTOMA COT 25 -NORMALIZED Homo sapiens 

cDNA clone CS0DC002YI01 5-PRIME, mRNA sequence. 

ACCESSION BX393735 

VERSION BX393735.1 GI: 30624044 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 



|3> 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



ORGANISM Homo «pi«^ chordata; Craniata ; Vertebrata; Euteleostomi ; 

Mimmalia? Eutheria; Primates; Catarrhini ; Homimdae; Homo. 
1 (bases 1 to 1201) 

hi W B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact: Genoscope 

Genoscope -Centre National de Sequencage 
•op 19 i 9100 6 SVRY cede* - France 

Email seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 

« y S constructed by Life Technologies, a division of 
I^vltrogen. This sequence belongs to sequence cluster 6951. r 
-ore information about this cluster, see 

contact " 

^^^■^^^^^ Corporation 1,00 
Paraday Avenu^Genoscope sequence ID : CS0DC0O2AE01QP1 . 
Location/Qualifiers 
1. .1201 

/organisms" Homo sapiens" 
/mol_type="mRNA" 
/ db_xr e f = " t axon : 9 6 0 6 " 

/clone="CS0DC002YI01" ■ 
/tissue type= " NEUROBLASTOMA COT 2 5 -NORMALIZED" 
/clone TiS "Homo sapiens NEUROBLASTOMA COT 2 5 -NORMALISED 
/not-'- lst strand cDNA was primed with a Nod -oligo (dT) 
primer- Five prime end enriched, double-strand ^-s 
diqested with Hot I and cloned into the Not I and EcoR V 
sites of ; -he pCMVSPORT 6 vector. Library was normalized. 
.. 34B a 223 c 239 g - 321 t 70 others , V 



FEATURES 

source 



3ASE .'COUNT 
ORIGIN 



74.0%; Score 750.8; DB 13; Length 1201; 
C-.ery Match , i co . 

Cst Local Similarity 91.0%; Pred No 3.1e-152 
Matches 766; Conservative 45; Mismatches 27, Indels 



4 ; Gaps 2 ; 



QY 
Db; 

QV 
Db 

Qy 

Db 

GY 
Db 

Qy 

Db 



x ATGAAAAAAAiGCGTTTGrrrAGTAAATCA^ ■« 

« CTGAAACACAATTTGGCCATTTTGGAAAAG^^ 120 
»1 CCCCCAAChGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACRGCAGTGGCCT(^TftGTO 240 

z eiim^ « 

A^ACTGATAGCTGACCTGCAGCTGATAGACTTTCAGGGAAAAAAAGATGTCACCCAGATA 300 



Qy 

Db 

Qy 

Db 



301 TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTACT 360 

■ iirKMOT^ 742 

a -n rr TTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCGTC 4 80: 
Qy 421 ^TTGTGGGATTATGCTG j ( ( ( ( ( ( j j , 

743 CGTTGTGGGATTATGCTC 802 

, al ttttctAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 540 
Qy ,81 TTTTCTAATCAAT j j , , , , , , , , , , , , , , | , , | , | | , | | | | | | | | | | | | | | | 

r 4 i TrArATGCCTTTGCTACTTTCAAGGATTTAGTAACCAGACATAAAGTGTTGGTAGCAGAC 6 00 

. 363 XCAGATGCCTTTGCTACTTTC 92 ' 2 
QY 601 TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGA^ 660 

JM T'Tcl'TAGAACAAAATTAC^ 982 
,, L 4A-TATCVTTACTAAGAGACAGTCTTTAAAGC 71 S 

'1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M I M : Nihil M I INI Ml 1 1! 

b J^U^ l0 - 42 - 

,-0 ''AArTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAA 77* < 

,0 iiii iUGCclK — 
... 0 ^CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGC^ B39T 



Qy 



Qy 



340 TG 841 



Db II SO TG 1161 

RESULT .8 

AK005323 • T379 bp mRNA linear HTC 05-DEC-2002. 

i;0 GuS AK00S323 cerebellum cDNA, RIKEN full-length enriched 

DEFINITION ^J^^^^^^^^ PROTEIN ho.olog IHo.O 

sapiens], full insert sequence. 
ACCESSION AK005323 

VERSION AK005323.1 01:12837793 

KEYWORDS HTC; CAP trapper. 

50UTCE Mus musculus (house mouse) 

^ORGANISM I «^ 1 - etazoa; chordata; craniata; Vertebrata; Euteleostojji; 

SSSSaT Eutheria; Rodentia; Sciurognathi; Murrdae; Murrnae; Mus. 

REFERENCE 1 , . 

AUTHORS Carninci,P. and Hayashizaki # Y . 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 

:.:.tf.nce 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHOR? 

TITLE 

JOURNAL 
REFERENCE 
. AUTHORS 



High-efficiency full-length cDNA cloning 
Meth. Enzymol. 303, 19-44 (1999) 
99279253 
10349636 

2 

Carninci,P., Shibata,Y. , Hayatsu,N., Sugah'ara , Y , , Shibata,K., 
.Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki , Y . 
Normalization and subtraction of cap-trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new genes 
Genome Res. 10 (.10), 1617-1630 (2000) 
20499374 
1 1042159 

3hibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N. , Carninci , P - ,. 
Konno f H'- # Akiyama,J., Nishi,K., Kitsunai,T. f Tashiro f H. f Itoh,M. , 
Sumi,N.. Ishii,Y., (SFakamura, S . , Hazama,M. , Ni shine ,T. , Harada,A., 
Yamamoto,R., Matsumoto , H . , Sakaguchi , S . , Ikegami,T., Kashiwagi , K. , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa f K.,. Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu, M. , Inpue,Y., Kira,A. and Hayashizaki, Y. 
R I KEN integrated sequence analysis (RISA) system- -3 84 -format 
sequencing pipeline with 384 multi capillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 



.'xx wai , J . 
Arakawa,T 
Aizawa , K. . 
oaito, T. , 
Kadota, K. , 



' £ F c '■• '■- - 

Shinagawa # A. , 3hibata,K. : , Yoshino,M., Itoti;M. , Tshii,.Y. , 
Kara, A., Fukunishi , Y . , Konno , H . , Adachi , J - , Fukuda , 3 . .- 
, 'lzawa,M., Nishi,K. # Kiyosawa,H., Kondo,S., Yamanaka , I . , 
' Okazaki, Y. , Gojobori . T . BGno,H., Kasukawa,T., Saito.-R., 
Matsuda,H. , Ashburner,M. , Batalov,S. , Casavant,T. , 
Fleischmann,W. ", Gaast'erland, f : ; Gissi , C . .... King , B . , Kochiwa,H. 
Kuehl,P., Lewis, 3., Matsuo, Y . , Nikaido, I . , Pesole,G., 
Quackenbush, J., Schriml, L.M. , . Staubli, F. , Suzuki, R. , Tomita,M., 
Wc-.gnor,L., Washio,T., Sakai,K., Okido,T: , Furuno,M., Aono,H., 
Balda.relli,R. , Barsh,G., Blake, J., Bof f el.Li , D . , Bojunga,N. ,_ 
Carninci, P., de Bonaldo, M . F . , Brownstein, M . J . , Bult,C., 
?leccher,C, Fujita,M., Gariboldi , M . , Gustincich, S . , Hill,D., 
Iofmann,M., Hume , D . A . , Kamiya,M., Lee , N . H . , Lyons , P . ; 
Marchionni,L. , Mashima,J., Mazzareili , J. , Mombaerts , P . , Nordone,P., 
Ring,B., Ringwald,M., Rodriguez , I . , Sakamoto, N., Sasaki, H., 
3ato,K., 3chonbach,C. , Seya,T., Shibata,Y., Storch,K.F., Suzuki, H. , 
Toyo-oka, K. , Wang,K.H., Weitz.C, Whittaker , C . , Wilming,L., 
Wynshaw-Boris,A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki , Y . 

Functional annotation of a full-length mouse cDNA collection 

Mature 4 09 (6321), 685-690 (2001) 

21085660 

11217851 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

Analysis of the mouse transcr i.ptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
.6 (bases 1 to 1379) 

Adachi, J. , Aizawa, K. , Akahira,S., Akimura,T., Arai,A., Aono,II., 



JOURNAL 



COMMENT 



Arakawa,T., Bono,H., Carninci,P-, Fukuda,S., Fukunishi , Y . , 
Furuno.M., Hanagaki.T., Hara,A., Hayatsu,N., Hiramoto,K., 
Hiraoka.T.. HOri.P.. Imotani.K., Ishii,Y., Itoh.M., Izawa.M.. 
Kasukawa,T., Kato.H., Kawai,J., Kojima,Y., Konno,H., Kouda,M., 
Koya S , Kurihara,C, Matsuyama,T. , Miyazaki,A., Nishi,K., 
Nomura, K., Numazaki, R. . Ohno,M. ( Okazaki,Y., Okido,T. ( Owa,C, 
Saitc.H., Sa:ito,R., Sakai.C, Sakai,K., Sano,H., Sasaki, D. , 
n'hibata.K., Shibata,Y., Shinagawa, A. , Shiraki,T., Sogabe,Y., 
Suzuki, H., Tagami,M., Tagawa,A., Takahashi , F . , Tanaka,T., 
Tejima,Y., Toya,T., Yamamura , T . , Yasunishi, A. , Yoshida,K., 
Yoshino,M., Muraraatsu,M. and Hayashizaki , Y . 

Direct Submission , ' 

Submitted (10- JUL-2000) Yoshihide Hayashizaki, The Institute oj. 
Physical and Chemical Research (RIKEN), Laboratory for Genome 
exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute ; 1 - 7 -22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail -.genome-resOgsc . nken . go . jp, 
URL: ht-tp: //genome, gsc.riken.go.jp/, Tel : 81-45-503-9222 , 
Fax:81-45-503-9216) . 

Please visit our web site (http://genbme.gsc.r1ken.go.3P/rtor . . , f ^, 

further details. 

cDNA library was prepared and sequenced in Mouse Genome 
encyclopedia Project of Genome Exploration Research Group in Riken • . 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
•—pare mouse tissues. First strand cDNA was primed with a primer y , . 

GAGAGAGAGAAGGATCCAAGAGCTCTTTTTTTTTTTTTTTTW 3 • ] , cDNA was 
'.'-•■pared by using trehalose thermo- activated reverse transcriptase p.v. .J 
•nd subsequently enriched for full-length by cap-trapper. Second . | 

; , , and cDNA was prepared with the primer adapter of sequence [5 V 
'lAGAGAGAGAGCGGCCGCAATTAATTCTCGAGTTAATTAAATTAATCCCCCCCCCCC 3 ] . cDNA" 

was cleaved with Xhol and Sst.I. Cloning sites, 5' end: Xhol; 3' , J 

end: Sstl. Host: SOLR. $■ 
Location/Qualifiers v - . • -fc 

..? ' 1- .1379 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xref = " FANT0M_DB : 1500 03 1K13 " 

/db_xref ="MGI : 1901050" . 
/db_xref="taxon: 10090" 
/clone="1500031K13" 
/sex= "male" 

/ t i s s ue_ type =" c e rebe 1 1 urn " 

/clone lib= "RIKEN full-length enriched mouse cDNA library 
/dev stage = " adul t " 

/note="unnamed protein product; M025-LIKE PROTEIN homolog 
[Homo sapiens] (SWISSPROT | Q9H9S4 , evidence: FASTY, 
98.2%ID, 100%length, match=1002) 
putative" 
/codon_start=l 
/protein_id="BAB23953.2" 
/db_xref="GI: 2 6342524" 
/db xref="MGI: 1916258" 

' trans lation="MKKMPLFSKSHKNPAEIVKILKDNLAILEKQDKKTDKASEEVSK 
PLQAMKE I LCGTNDKE P PTEAVAQLAQELYS SGLLVTL I ADLQL I DFEGKKD VTQ I FN 



Db 



NILRRQIGTRCPTVEYISSHPHILFMLLKGYEAPQIALRCGIMLRECIRHEPLAKIIL 
FSNQFRDFF 

SENYVTKRQSLKLLGELILDRHNFTIMTKYISKPENLKLMMNLLRDKSPNIQFEAFHV 
FKNSVFITNRIHGLKRWLSS " 

polyA_siqnal 1361. .13 66 

/note= "putative" 

polvA__si:te 1379 

/note= "putative" 

BASE COUNT 452 a 284 c 291 g 352 t 

ORIGIN 

Ouery'Mntch 70.0%; Score 709.4; DB 11; Length 1379; 

~ ^ j'r ^ q-imi Taritv 88.9%;. Pred. No. 3e-143; 

Sc^ ;^ Conservative 0; Mismatches 96; mdels 0; Gaps 0, 
1 ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAG€AGAAATTGTGA^ATC 60 

Qy 1 M?tmtttlMlllllllllllli|l|ll!IIIIIIIMIillllllllll INN 

235 ATGAAAAAAATGCCTTTC 344 

--, r-TC a a ar ACAA.TTTGGCGATTT1 GGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 12 0 : 
Qy , ,1. CTGAAAGACAATTTGGCCA ,,,,,,,,,,,,,,, | ,., | | | | , | | | , | | | | | | | | | | 

345 .UgaUgac^ 404 

GAAGTGTCTAAATCACTGCAAGCAATGAAAG^ 
,,, GAGGTGTCAAAACCTCTGCAAGC 464 

• 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 !M i i II 1 MM M I M I hi 

i5 5 CCCCCTACAGAAGCAGTC 524 
'*■'•> 1 ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTG^ 30O 

. — 1 , T i T i 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 I II Mill Ml II MM MMMMIMMMM 

301 TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTC 360 

II MINIMI I II 1 1 1 1 1 1 II II II I MM Mill M Ml 

r,»5 TTCAACAACATCCTGAGAAGA^ 644 
- ,-i rrTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 420 

I III MM Mill II II MUM I II 1 1 1 M I II I II II I II II II II 1 1 M II ! nA 

f)45 t^CATCCTCAC^ 704 
,21 CGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATC 480 

ii 1 1 ill Ml III III Mill I Mill II MM- - Ml Mill Mill Ml III 

7C-5 CGCTGTGGGATTATGCTAAGAGAGTC 764 
481 TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTC 540 

M II ll I II 1 1 1 1 1 1 II I II M I M Ml II II M II M M 1 1 III 

7(J5 TTTTCTAATCAGTTCAGAGATTTC 824 
■■■i TCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGA.C 500, 

' ' TlilMIMM II Mill I MIIIMI MUM MM III III I Mill Ml II I 

825 TCAGATGCCTTCGCTAOT 884 
601 TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTOAG 660 
I I I I I I I MM I II I M I I M M I I I II M M II I II M II M I MM M I 1 1 1 1 1 



Db 



Qy 

Db 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



885 TTCTTAGAACAAAATTATGACACTATTTTTGAAGACTATGAGAAACTGCTGCAATCTGAG 944 

661 AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTC 720 

II || I II II I I I I I I I I I I I I I I 1 M I I 1 1 I I I 1 I I III 

•945 p^CTATGTGACAAAGAGACAAl'CTTT 1004 

721 AACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAA 780 

I i II 1 1 I I Mill I I I I I I I I 1 1 1 1 I I I 1 1 1 1 1 1 1 I 

1005 AATTTCACCATTATGACC^GTATATCA^ 1064 
78 1 CTCCTTCGGGATAAAAGTCCGAACATCCAGTTTGAA 840 . 

I I i i;i I II 1 1 1 1 II I II r| || I Mill M M 1 1 I I 

1065 CTGCTT<:GAGAC^^ 1124 
84i GTGGCCAGTCCTCACAAAACACA 863 

" 11 III I I Mill 

112 5 GTCTTTATTACAAATAGAATACA 1147 



RESULT 3 
BG218735 
LOCUS 

DEFINITION 
ACCESSION 
;/EPSION 
vCi:7W0RDS 

::;;. ; ;>CE 

:> DANISM 



;:9-ERENCis 
AUTHORS 



TITLE 

MEDLINE 
PUBMED 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini ; Hominidae ,- Homo . 



FEATURES 



source: 



Rr?18735 784 bp mRNA linear EST 21-APR-2001 

R3T38476 Athersys RAGE Library Homo sapiens cDNA, mRNA sequence. 
3G218735 

3G218735.1 GI: 13744756 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryoca; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
; (bases 1 to 784) ,: 
■iarrinqton,J.J., Sherf.B., Rundlett , S . , Jackson, P. D. , Perry, R, 

9 L^nfnal.C. Thornton, M-, Ramachandran , R . , Whittington, J. 
, Costanzo.D., McElligott,K., Boo,er,S., Mays R. Smxth 
iE ielosoV, Klika,A., Hess, J. , Cothren,K., Lo.K,; Offenbacher 

J Danziq, J. and Ducar,M. , 
Cation or genome -wide protein expression libraries using, random 

activation of gene expression 

Nat. Biotechnol. 19 (5), 440-445 (2001) 

21227151 

11329013 

Contact: Scott J . Cain 
Athersys, Inc. 

3201 Carnegie Ave, Cleveland, OH 44115, USA 
Tel: 216 431 9900 
Fax: 216 361 9596 
Email : scain@athersys . com 
High quality sequence stop: 515. 
Location/Qualifiers 

1. .784 

/organism="Homo sapiens" 
/mol_type=*'mRNA" 
/db_xref ="taxon: 9606" 
/celi_line="HT1080" 
/clone lib= "Athersys RAGE Library" 

/note=»See 'Creation of Genome-wide Protein Expression 
Libraries using Random Activation of Gene Expression', 



Nature Biotechnology., in press. Note that even though the 
cell type indicated is HT1080, since a random activation 
method was used, these sequence tags are not necessarily 
expressed in HT1080 under normal circumstances." 

BASE COUNT 265 a 151 c 158 g 209 t 1 others 

ORIGIN 

Query Watch. 66.3%; Score 671.8; DB 10; Length 784; 

Best Local Similarity 96.5%; Pred. No. 3.8e-135; 

Matches, 713; Conservative 0; Mismatches 23; Indels 3; Gaps 3; 

1 ^TGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 60 

" MINI! II iMiMliliMMtlllilillMllllllllililll Mill 

Db 32 ATGAAAAGAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGTAAATC 91 

... PTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 120 

MililllllliillililllllllMllllillllllilltllllllilllllllllll 

Db o 2 CTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 151 

GAAGTGTCTAAilTCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 180. 

iiiilllllllllllijillllllMIIIIIIIIIIMIIIIilllllllMIIIIIIII 

Db 152 GAAGTGTCTAAATCACTGCAAGGAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 21* 



Qy 



lih 



Qy 



!.iO 



• • i CCCCGAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 240^ 

MM Mil II MM 1 1 MM IMIMIIMMilMIMMMMMMIMMIMM 

:■: 2 CCCCCAACAGAAGCAGTGGCCCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGGTAGTG 271- 
-■^ACTGATAaCTGACGTGCAGC-rGATAGACTTTGAGGGAAAAAAAGATGTGACeCAGATA 3 00, 

Vi||!|||!iii|||ii|||MI!lllilllillll!:l!MMIil IMIIMM 

V. :v: aCACTGATAGCTGACGTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA. 3 3,1. 

• ' TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATAtTAGT 3 6T>. 

MMlilMMMMMMIMMMMMIMMMIMMMMMMMMMMM ■ 

3 TTTAACAACATCTTGAGAAGAGAGATAGGCACTCG3AGTCCTACTGTGGAGTATATTAGT 3 91- 
- "1 GCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGAtTGCCTTA 4 2.0 

" IMIMIMMMMIMMMIMIMMMIMMIMMMMMMMIMM II 

3 32 3CTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCCTA 4 5-1 
a 2 1 CGTTC'fGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATGCTC 4 8 0 

" 1 1 1 1 1 1 1| 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 

■x$ ■> GGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 5 1 1- 



, B1 •fTTTCTAATCAATTGAGAGATTTCTT'TAAGTACGTGGAGTTGTCAACATTTGATATTGCT 540" 

■ 1 1 1 1 1 II i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II ' i M i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

B 1 2 TTTTCTAATC AATTCAGAG ATTTCTTTA AGTACGTGGAGTTGTC AACATTTGATATTGCT 571 



, n TCAGATGC CTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGAC 60 0' 

QY " mm ill II MM II MM II Ml 1 1 llllll MM M ill I MM I M III III Mill 

b 572 TCAGATGCCTTTGCTACTTTCAAGGATTTAGTAACCAGACATAAAGTGTTGGTAGCAGAC 631 



fw 6P1 TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATG-AGAAATTGCTTCAGTCTGA 659 

Q -' "• Mini I Ml I I MM II Mill II I llllll 

Db 632 TTCTTAAAl 



CAAAATTACGACACTATTTTTTGAAGACTATGAAGAAATTGCTTCAGTCTGA 691- 



( 60- GAATTATGTTAC - TAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTC 718 

Q/ ■ TmTimmim MMMMIMIMM MMMMIMMMMMIMI I •! I 



Db 
Qy 
Db 



692 GAATTATGTTACTTAAGAGACAGTCTTTAGAGCTGCTAGGGGAGCTGATCCTGAAANGTT 751 



71 o AC AACTTTG C C ATC ATG AC AAAGT 7 42 

" I HI II I II 1 1 NMI I HUM 

7 52 ACAACTTTGCC - TCATGACAAAGT 774 



RESULT 10 

AK013161 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
. ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
•JOURNAL 
MEDLINE 
.PUBMED 

T -.Ei r ERENCE 



Journal 

MIDLINE 
... I-'uBMED 
^.Ll v ERENCE 
.^.UTHORS 



TITiiE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Kuridae; Murinae; Mus . 



1281 bp mRNA linear HTC 05-DEC-2002 

AK ° 13161 i a 10 11 days embryo whole body cDNA, RIKEN full-length 
Mus musculus 10, H aays emuiyu wx j protein 

enriched library, clone : 2 810425013 product = M025 -LIKE PROTEIN 
homolog [Homo sapiens] , full insert sequence. 
AK013151 

AK013161.1 GI:12850350 
HTC; CAP trapper . 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
1 

Carniiici,P. and Hayashizaki , Y . 
High-efficiency full-length cDNA clonxng 
Meth: Enzymol. 303, 19-44 (1999) 
99279253 
1034 963 6 

-r-iinci P Shibata,Y.; Hayatsu,N., Sugahara,Y., Shibata,R:^r 
...o.rnxncx, h. , rsi*m>\ v Murama h su M . and- Hayashxzaki , Y . 

Genome Res. 10 (10), 1617-1630 (2000) ; ' 

20499374 

110421S9 .. „■'• 

^•k ^ v Ttoh M Aizawa,K., Nagaoka,S., Sasaki, N. , Carninci, P. , 
Shibata,K. Itoh.rt., Aiza , a Tashiro,H., Itoh.M., 

r^i H -'iSuT'^^SSi:5:; ££EV.. 
^t;/ R 8hi ;™o,H., sa ri ,s It i T ■£335:$:; 

_ jr TArraw^ Y Tzawa,M. , OUara,ri . , WdLaniJvx,i , 

Okazak Y Muramatsu , M . , Inoue,Y., Kira,A. .and Hayashxzaki Y. 
R I KEN integrated sequence analysis (R1SA) system- -3 84 -format- 
sequencing 9 pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 
4 

Kawai , J - , 
Arakawa , T 
Aizawa, K. 
Saito,T\ , 
Kadota, K. 



Shinagawa,A., Shibata,K., Yoshino,M Itoh.M. , lehii. Y. , 
Hara,A., Fukunishi,Y. , Konno,H., Adachi.J.. Fukuda,S., 
'izawa M. Nishi.K., Kiyosawa,H., Kondo, S . , kanaka,!. . 
Okazaki,Y., Gojobori,T., Bono.H., Kasukawa,T. , Saito,R. < 
Kadota,K., Matsuda,H A~ Batalov S Casavant.T. ^ 

l^7^sZ 5 ~^*^l'> Pelole'.C. 
Quack^sh?"! Schriml.L.M., Staubli,F., Suzuki, R. , Tomxta,M., 
Sagner?L ! Washio.T., Sakai,K., Okido,T., Furuuo,M., Aono,H. , 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



.x :;p.na.l 



COMMENT 



Baldarelli,R. , Barsh,G., Blake, J., Boffelli,D., Bojunga f N. f 
Carninci,P., de Bonaldo, M . F . , Brownstein , M . J. , Bult,C, 
Fletcher, C, Fujita,M., Gariboldi , M - , Gustincich, 3 . , Hill,D., 
Hofmann,M. , Hume,D.A., Kamiya,M., Lee,N.H., Lyons, P., 
Marchionni,L., Mashima, J . , Mazzarelli , J . , Mombaerts, P. , Nordone,P., 
Ring B., Ringwald,M., Rodriguez , I . , Sakamoto, N., Sasaki, H., 
Sato,K., Schonbach,C, Seya,T., Shibata,Y., Storch,K.F., Suzuki, H. , 
Toyo-oka,K., Wang,K.H., Weitz,C, Whittaker , C . , Wilming,L., 
Wynshaw-Boris,A., Yoshida,K., Hasegawa # Y. f Kawaji,H., Kohtsuki,S. 
and Hayashizaki, Y. 

Functional annotation of a full-length mouse cDNA collection 
Nature 409 (6821), 685-690 (2001) 
21085660 
11217851 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team/ 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Mature 420, 563-573 (2002) 
6 (bases 1 to 1281) 

Adachi,J., Aizawa,K., Akahira,S., Akimura,T., Aral, A. f Aono,H., 
Arakawa,T., Bono,H., Carninci,P., Fukuda,S., Fukunishi , Y . , 
Furuno,M., Hanagaki,T., Hara,A., Hayatsu,N., Hiramoto,K., . 
,'iraoka,T., Hori , F . , Imotani,K., Ishii , Y . , Itoh, M - , Izawa,M. , 
.lasukawa,T., Kato,H., Kawai,J., Kojima,Y., Konno,H., Kouda . M . , 
= .oya,S., Kurihara,C, Matsuyama , T . , Miyazaki , A. , Nishi,K., 
Nomura , K . , Numazaki , R . , Ohno , M .. , Okazaki , Y . , Okido , T . , Owa , C . , 
3aito,F., Saito,R., Sakai,C, Sakai,K., Sano,H., Sasaki, D., 
;>tiifoata,K*, Shibata,Y. ? Shinagawa, A. , Shiraki, T. , Sogabe, Y. , 
,'izuki,H., Tagami,M. , Tagawa,A., Takahashi , F . , Tanaka,T., 
Tej ima, Y . , Toya , T . , Yamamur a , T . , Yasuni shi , A . , 
YoshiiiO .M., Muramatsu,M. and Hayashizaki , Y . 
Direct Submission 

Submitted (10- JUL-2000) Yoshihide Hayashizaki , 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome -resOgsc .riken.go.jp, 
URL: http://genome.gsc -riken.go.jp/, Tel : 81 -45 -503 - 9222 , 
Fax:81-45-503-9216).. , 
Please visit our web site (http://genome.gsc.riken.goop/.) for 

further details. 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken . 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. First strand cDNA was primed with a primer . 
[5' GAGAGAGAGAAGGATCCAAGAGCTCTTTTTTTTTTTTTTTTVN 3 • ] , cDNAwas 
-prepared by using trehalose thermo-activated reverse transcriptase 
Ind subsequently enriched for full-length by cap-trapper. cDNA went- 
through one round of normalization to Rot = 7.5 and subtraction to 
Rot = 37.5. Second strand cDNA was prepared with the primer adapter 

of sequence [5' ' 
GAGAGAGAGATTCTCGAGTTAATTAAATTAATCCCCCCCCCCCCC 3']. cDNA was cleaved 

with Xhol and Sstl. Cloning sites, 5' end: Xhol; 3' end: Sstl. 
Host: SOLR. 



Yoshida, K. , 



The institute, of 



FEATURES Location/Qualifiers 
source 1- .12 81 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="C57BL/6J" 
/db xref ="FANTOM_DB :2810425O13" 
/db~xref="MGI: 1908997" 
/db_xref = "taxon : 10090 " 
/clone=" 2 810425013" 
/tissue_type= "whole body" 

/clone_lib="RIKEN full-length enriched mouse cDNA library 
/dev_stage="10, 11 days embryo" 

<n\r.c feature 285. .1096 . 

/ no t-e="M025-LIKE PROTEIN homolog [Homo sapiensj 
(SWISSPROT|Q9H9S4, evidence: FASTY, 98.2%ID, 100%length, 
match=1002) 
putative" 

/db_xref="MGI: 1922871" 
BASE COUNT 333 a 275 c 291 g 332 t 

ORIGIN • - 

Query Match 61.4%; Score 622.8; DB 11; Length 1281; 

Best Local Similarity 82.8%; Pred. No . 1 . 7e-124 ; 

•:ktch«8 76* i Conservative 0; Mismatches 97; Indels 62 , Gaps 2, 



ACtACAAAAAGACAGACAAGGCTTCAGAAGAAGTGTCTAAATCACTGCAAGCAATGAAAGA 15*. 

'ill i ( II j I i | i I i 1 1 1 1 1 1 1 1 1 1 1 1 MMI II I M M II I M M M 

,V, 237 UaAGACAGGATTTCTAAGGC-TCAGAAGAGGTGTCAAAATCTCTGCAAGCAATGAAGGA 296 

^CTCTGTGTGGTACA^CGAGAAAGAACCCCCAACAG 212 

'm i MINIMI II Mlli 'II i I 1 1 1 1 1 MMIIIIIMIillMil! II H 
A: - 5 : Aattctgtgtggaac<4cgacaaggagccccctacagaagc 

• - 3 agaACTCTACAGCAGTGGCCTGCTA^GTGACACTGATAGCTGACCTGCAGCTGATAGACTT 2 72-: 

. / In Ml 1 1 lli II UN I II ! 1 1 1 1 IIIIIIIMMMMM llllllll 
m ,yn ggagctctacagcagcgggttgct 416 " 

, 73 TG AGGGAAAAAAAGATGTGACCCAGATATTTA^ 332 

M I Mi II I III Ml ill I II Mill M MM Ml IMIMMMIM M I! 

417 TGAGC^AAAAAAAGATGTGAC^ 476 
-33 TCGGAGTCCTACTGTGGAGTATATTAGTGCTCATCCTCATATCCTGTTTATGCTCCTCAA 3 92 

l?T Mil I 111 II MMI II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M I II 1 1 1 1 1 1 Mill 

4-77 ACGGTGTCCTACTGTCGAGTACATCA^ 536 
1 3 . AGGATATGAAGCCCCACAGATTGCCTTACGTTGTGGGATTATGCTGAGAGAATGTATTCG 452 

II III MMI I Mill I M Ml Mlli I Mill MM Mill Mill llllllll 

537 AGGCTATGAAGCCCCACAGATTC 596 
453 ACATGAACCACTTGCCAAAATCATCCTCTTTTCTAATCAATTCAGAGATTTC 512 

I Mill MUM IMIMMMIM I Ml II MM I II II II I Mill 

,37 ACATGAGCCACTTGCCAAA^ 656 
513 CGTGGAGTTGTCAACATTTGATATTGCTTCAGATGCCTTTGCTACTTTCAAGGATTTACT 572 

II III MM II llllllll II M M II II I II I I IIIH I Nil 

657 TGTTGAGCTGTCCACCTTTGATATCGCTTCAGATGCCTTCGCTACTTTTAAG '- - 708 



rib 

Qy 

Db 

Qy 
Db 

Qy 

Db 



Qy 

Db 

Qy 

Bb 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 



i. -D 



57 3 AACCAGACATAAAGTGTTGGTAGCAGACTTCTTAGAACAAAATTACGACACTATT 632 

, jr)Ci ATTTTTGA 716 

63 3 AGACTATGAGAAATTGCTTCAGTCTGAGAATTATGTTACTAAGAGACAGTCTTTAAAGCT 6 92 

I MINIMI II I MM II I II I MM Mill II IIIIIMI MM III II I 

717 AGACTATGAGAAACTGCTGCAATCTGAGAACTATGTGACAAAGAGACAATCTTTAAAGTT 77 6 
693 GCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATGACAAAGTATATCAGCAA 752 

• T MUM Mill III Ml III II I I MM II MM Mill III IIIIIMI I 

777 GCTAGGTGAGCTGATCCTGGACCGCCACAATTTCACCATTATGACCAAG- -TATCAGCAA 834 
7<v3 GCCGGAGAACCTGAAACTCATGATGAACCTCCTTCGGGATAAAAGTCCCAACATCCAGTT 812 

Ml IIIIIMI MINI Mini II I II 1 1 1 1 1 M KIM I IIIIIMI I II II 

3 3 5 GCCAGAGAACCTGAAACTGATGATGAACCTGCTTCGAGACAAAAGTCCCAACATCCAATT 894- 
813 TGAAGCCTTTCATGTTTTTAAGGTGTTTGTGGCCAGTCCTGACAAAACACAGCCTATTGT 872 

MINIM INN INN II I INI II INN I N N I II II I 1 1 INN I N 

3 95 CGAAGCCTTCCATGTCTTTAAGGTGTTTGTGGCCAGCCCCCACAAAACGCAGCCTATCGT 954 
373 GGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAGCAGCTTCGAAAAAGA 93 2 

\\\\\\ - I I ■ - I i 1 1 1 I I 

95 5 GGAGATTCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAGCAGCTTTCAGAAAGA 1014 
"3 J AAO-GACGGATGATGAGCAGTTCGCTGACGAGAAGAACTACTTGATTAAACAGATCCGAGA 9 92 

i INN II- II II IN I II KM I II MM II II III 1 1 1 1 1 i E 1 1 1 IN INN _ 

i.Ol 5 AAGGACAGACG ACGAGCAGTTTGCTGACOAGA AGAACTACCTGATTAA AC AG AT.TCGAG A 1074 
<\q-$ r^TGAAGAAAACGGCCCCTTGA 1014 

III INI IN I INN III 

107 5 CTTG AAG AAAGC AG C C CCGTG A 1096 ' 



result 

BU11652^ 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



3U116522 951 bp mRNA' linear EST 25 -NOV- 2 002 

603139786F1 CSEQCHL15 Gallus gallus cDNA clone ChEST12 912.2 5 ' ; mRNA 
sequence . 

/./U116522 ' ; 

BU116522.1 GI: 25323402 

EST. 

Gallus gallus (chicken) 
Gallus gallus 

Sukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Archosauria; Aves; Neognathae; Gallif ormes ; Phasianidae; 
i?hasianinae; Gallus. 
1 (bases .1 to 951) 

Boardman,P.E. , Sanz-Ezquerro, J . , Overton , I . M . , Burt,D.W., Bosch, E., 

Fong,W.T., Tickle, C, Brown , W . R . A . , Wilson, S . A . and Hubbard, S . J. 

A Comprehensive Collection of Chicken cDNAs 

Curr. Biol. 12 (22), 1965-1969 (2002) 

22335534 

12445392 

Contact: Simon Hubbard 

Department of Biomolecular Sciences 

University of Manchester Institute of Science and Technology -(UMIST 
) 



PO Box 88, Manchester, M6 0 1QD, UK 
Tel: 01612008930 
Fax: 01612360409 

Email : Simon.Hubbard@umist .ac.uk. 
FEATURES Location/Qualifiers 
source 1- -951 

/organism="Gallus gallus" 

/mol_type="mRNA" 

/strain="Compton Line 151" 

/db_xref ="taxon: 9031" 

/clone="ChEST129122" 

/sex=" Female 11 

/tissue_ type=" cerebrum" 

/ de v_s t age = " adu 1 t " 

/lab_host="DH10B" 

/clone_l ib= " CSEQCHL15 " 

/note- "Organ: brain; Vector: pBluescript II KS (+) ; Site_l: 
EcoRI; Site_2: Notl; Modification of pBluescript II KS (+) 

[Stratagene] vector to accommodate cDNA produced with the 
T- trimmed protocol (Construction of uni -directionally 
cloned cDNA libraries from messenger RNA for improved 3 1 
end DNA sequencing by Glenn Fu, et al . U.S. Patent # 6,387 

,62.4) . Cut pBluescript II KS( + ) with Notl and EcoRI. 
Ligate in double stranded adaptor containing Bsgl and 
BamHI sites [5 ' ggccgcgtgcagccccggatccgaaaaaaag] 

[5 ■aattctttttttcggatccggggctgcacgc] " 

r\A& ...:ou:r,: 303 a :"*os c 202 g 240 t; <" T 

ORTGM 

Query MaiuVh 60.6%; Score 614; DB 13; Length 951; >■ t. :> 

K^t Local Similarity 32.8%; Pred. No. 1.3e-122; • . 

M^iuKtefi 737; Conservative 0; Mismatches 150;, Indels. 3; ;Gaps - .; 3; 

• 21 GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 130 

MIMIII Mill IMMMIMMM 1 1 1 1 1 i 1 1 II 1 1 1 II I' II i I II 

6 GAAGTGTCAAAATCTCTGCAAGCAATGAAGTAAATTCTGTGTGGGACCACAGACAAGGAG 65 



Qy 
Db 



91 CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 240 

" 1 1 || I i 1 1 1 1 1 i 1 1 1 1 II II 11 i 1 1 1 1 1 1 1 1 1 I Ml I II II I II M II II II 

66 CCACCGACAGAAGTAGTGGCTCAGCTGGCACAAGAATTGTACAACAGTGGCCTTCTAGTG 125; 
^41 acACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 3 00 

Mill M M IMMIIMI Mill MIIMM Mill Mill MIMIII 

Db 126 acaCTT aTTGCCAACCTGCAGCTCATAGATTTTGAGGGTAAAAAGGATGTTTCCCAGATA 185 

nv «oi ^TTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 3 60 

Qy ' " T 1 1 1 1 1 1 1 1 1 i I IMMIIMI II Mill MIIIIMI II 1 1 liM ^ 

Db^ 18 6 TTTAACAACATCCTGAGAAGACAAATTGGCACACGAAGCCCTACTGTGGAATACATTAGT 24 5 

GCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 42 0 

u/ '. " \\ II II I II III I II II Mil 1111 I ! 1 1 1 II! 

Db - : . 46 GCCCATCCACATATCCTGTTCATGCTTCTGAAAGGCTATGAATCCCCAAATATTGCCTTA 3 05 

nv 42 i CGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAA - AATCATCCT 47 9 

QY M Mill MIMIIIMI II II II IIMIIIIIIII MINI M 

Db SO 6 CGCTGTGGAATTATGCTGAGGGAGTGCATCCGACATGAACCATTGGCCAACAATCATAC j. 36 5 



Qy 4 80 CTTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGC 53 9 

1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I ! 1 1 II 1 1 1 1 1 1 1 1 1 1 ! ! 1 1 1 1 1 1 1 ! 1 1 

Db 3 66 TTTTTCAGAACAGTTCAGAGACTTCTTCAAGTATGTGGAAATGTCAACATTTGATATAGC 4 25 

Qy S4Q TTCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGA 5 99 

I! 1 1 1! Ill II 1 1 II MM II II I II MM M II II 1 1 1 1 i i 1 1 1 M 1 1 

Db 426 ATCTGATGCCTTTGCTACATTCAAGGACTTGTTAACAAGGCACAAGTTGTTGGTAGCAGA 4 85 

q v 600 CTTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGA 659 

II I I Ml Mill II II II II Mill II Mill III I Mill Mill 

D b .4 3 6 , TTTTATGGAACAAAATTATGAT ACGATCTTTGAGGATTATGAAAAACTCCTTCATTCTGA 5 4 5 

Qy s6G GAATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCA 719 

MUM M II I II Ml Ml Mill MM MM 1 1 II MM M III I II 

Db iA6 GAATTACGTAACAAAGAGACAGTCTTTGAAGCTGCTGGGTGAATTGATTCTAGACAGACA 6 05 

Qy 72 0 CAACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAA 779 

Ml Ml I III II I III II II II I Mill III II Mill Mill II MM! Ml 

Db 6 36 CAACTTCGCCATCATGACAAAATATATCAGCAAACCAGAGAATCTGAAGCTGATGATGAA S6S 

Qy 7 80 CCTCCTTCGGGATAAAAGTCCCAAGATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTT 83 9 

i I M M II MM! Ill II II I II IIMIMI II Mill II Mill M 

S,b ■ :&S,6 CTTGCTGCGAGAC.AAAAGCCCCAACATTCAATTTGAAGCATTCCATGTGTTCAAGGTTTT 72 5 

?10 ' !1 GTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCC7VA 8 99 

1 1 M i I 1 1 M 1 Ml II II Mill II MM I I MM I i' I Mill III MM 

■ 7;: 6 TGTGGCCAGTCCAAAC AAAACTCAGCCCATCGTGGAGATCCTGCTGAA AAACCAG - CCAA 7 84, 
- -) 0 0 -iCTCATTGAGTTTCTGAGCAGCTTCCA - AAAAGAAAGGACGGATGATGAGCAGTTCGCTG 9 5 8 

IMM IMMMIMM IMM III II IMIMI M! IMMMM I I 

Dh, 7. 8 5 GCTCATCGAGTTTCTGAGCCATTTCCAGAAACGAGAGGACGGTTGACGAGCAGTTCACCG 844 

Qv '45 9 P- CG A^ AAGA AC T AC TTG AT T. AAAC AG AT C CG AG ACTTG AAG AA AACGG C C 10.0 8 

IMM II I M M M MM M II I II II I i II I II M M I ill 

Db 045 ACGAG AAGAACT AC CTG ATC AAG CAAATC CGAGA CTTG A AG AAGGC CG AC 3 94 



RESULT 12 
r.()66 9953 
LCCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BQ669953 .. 982 bp inRNA linear EST 15-JUL-2002 

AGENCOURT_82 03 755 NIH_MGC_102 Homo sapiens CDNA clone IMAGE : 6255924 
5 1 mRNA sequence. 
BQ669953 

3Q669953 -1 31:21780787 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 982) 

NIH-MGC htup://mgc.nci .nih.gov/ . 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail , nih.gov 
Tissue Procurement: ATCC 
cDNA Library Preparation: Rubin Laboratory 



cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DMA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/ LLNL at: 
http://image.llnl.gov 
Plate: LLCM24 07 row: m column: 13 
High quality sequence stop: 508. 
Location/Qualifiers 

e 1. .982 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/ db_xr e f = "taxon:9606" 
/clone= " IMAGE : 6255924 " 

/tissue type=" epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant) " 
/clone lib="NIH_MGC_102" 

/iiote="Organ: salivary gland; Vector: pOTBV; Site_l : Xfiol; 
Site 2- EcoRI; cDNA made by oligo-dT priming. 
Directiohally cloned into EcoRl/XhoI sites using the 
following 5' adaptor: GGCACGAG(G). Library constructed 
by Ling Hong in the laboratory of Gerald M . Rubin 
(University of California, Berkeley) using ZAP-cDNA 
synthesis kit (Stratagene) and Superscript II RT (Life 
Technologies). Note: this is a NIH_MGC Library." 
217 a 200 C 357 g 197 t 11 others ..- . 

58.6%; Score 594.2; DB 13; Length 982; • • 

al B-.nilarity 96.5%; Pred. No.. 2,6e-118; - 
K3; conservative 0; Mismatches 20; Indels 3; ifcap.s: 2; 

334 CGGAGTCCTACTGTGGAGTATATTAGTGCTCATCCTC 393 

Mlill'illillMIM IIIIIIIIIIIIMIIIIIIMIMI lllll.llhlll.il. 
i cggagtcctactgtggAg-atatt ^ 

394 GGATATGAAGCCCCACAGATTGCCTTA^ 453 

II I INI Ml I II MUM Mill I III I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MUM 

GO GGATATGAAGCCCCACAGATTC 119 

454 CATGAACCACTTGCCAAAATCATCCTCTTTTCTAATCAATTCAGAGATTTCT 513 

I | |! I i I II I M I I I I I I I M h I I h I I I I I II I I M I I I I I II I I I I I I M I M I M 
,20 CATG^CCACTTGCCAAAATCATCCT 179 

314 GTGGAGTTGTCAACATTTGATATTGCTTCAGATGCCTTT "3 

ii i mm 1 1 imi 1 1 inn mi iiiiMiiiiii m i 1 1 h nun ii in 1 „ a 

130 GTGGAGTTGTCAACATTTGA 239 
„, ACCAGACATAAAGTGTTGGTAGCAGACTTCTTAGAACAAAATTACGACACTATTTTTGAA 633 

MMIMMltllll III II HIM MM I HIM II IMI I III HIM I M Mill I 

340 ACCAGACATaIJvGTGGTGGTA^ 2 "' 
634 GACTATGAGAAATTGCTTCAGTCTGAGAATTATGTTACTAAGAGA^ 6*3 

i I ii 1 1 1 ; 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 r 1 1 1 1 1 1 i I III II MUM I II III IMMM I 

300 gactatgagaaattgct 359 
6 -4 ctacgggagctgatcctggaccgtcacaactttgccatcatgacaaagtatatcagcaag 7 5 3 



419 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 
Qy 
Db 



360 CTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATGACAAAGTATATCAGCAAG 

jAAACTCATGATGAACCTCCTTCGGGATAAAAGTCCCAACATCCAGTTT 813 



754 CCGGAGAACCTG. 



in mi in nijiiinniMJIii'iliilii'i''., 



479 



420 CCGGAGAACCTGAAACTCATGATGAACCTCCTTCGGGATAAAAGTCCCAACATCCAGTTT 
814 GAAGCCTTTCATGTTTTTAAGGTGTTTGTGGCCAGTCCTCACAAAACACAG^ 873 
30 GAAGCCTTTCATGGTTTT^GGGGGT^ 539 



4 8 



933 



874 GAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCT 

i Ml II Ml 1 1 II I II II I Ml MM II ■ M 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 M 1 1 1 1 1 1 

540 GAGATCCTGGT 5 " 
.,34 AGG - - ACGGATGATGAGCAGTTCGCTGACGAGAAGAACTACTrG ATTAAAC 982 



0 0 AGGGACGGGATGATGAGC 



650 



RESULT 13 

BU518807 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 

/•ords 

':.'.{■< -WHISK 



AUTHORS 
TITLE 
: JOURNAL 
.£^0«BNT 



source 



3U51880 7 934 bp mRNA linear EST 12-SEP-2002 

AGENCOURT_10171930 NIH_MGC_134 Mus musculus cDNA clone 

IMAGE: 6516567 5', mRNA sequence. 

HU51SS07 ■ 

3TJ513807.1 01:22826333 

;.;ST. 

. us musculus (house mouse) ' -"° ; . 

-iLryota;^ Chordata; Craniata; Vertebrata; Euteleostomi ; 

mammalia? Eutheria; Rodent ia; Sciurognathl ; Huridae; Murxnae; Mus. 
i {bases 1 to 934) ? ' . 

•UH-MGC http://mgc.nci.nih. gov/. ( . 

National Institutes of Health, Mammalian Gene Col.iec'cion (MGC) 

Unpublished _. f 
Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail-nih.gov , 
-issue Procurement: Dr. David Rowe 
cDNA Library Preparation: Invitrogen Corp 

-DNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
■ rionp distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/ LLNL at: 
http://image.llnl.gov 

Plate: LLAM14095 row: e column: 16 
High quality sequence stop: 556. 
Location/Qualifiers 
1. .934 

/organism="Mus musculus" 
/mol_type="mRNA" 
/db_xref="taxon: 10090" 
/clone=" IMAGE: 6516567" 
/tissue_type= "undifferentiated limb" 
/lab_host="DH10B (phage-resistant) " 
/clone lib="NIH_MGC_134" 

/note= "Vector: pCMV-SP0RT6 . 1 . ccdb; Site_l: EcoRV; Site_2 . 
NotI; Cloned unidirectionally - Primer: Oligo dT. Average 



Db 

Qy 



Db 



LiD, 

Qy 

Db 

C-y 



Qy 

Db 



insert size 1.7 kb . Constructed by ResGen, Invitrogen 

Corp. Note: this is a NIH_MGC Library." 

BASE COUNT 301 a 198 c 200 g 234 t 1 others 

ORIGIN 

Query Match 57.7%; Score 585. 2; DB 13; Length 934; 

Best Local Similarity 89.0%; Pred . No. 2.3e-116; 

Matches 654; Conservative 0; Mismatches 79; Indels 2; Gaps 2; 
n „ 9 TGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCC 61 

Qy 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIMUM II IN INN INI Mill I 

Eb 113 TGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTCAAAATTC 172 

S2 TGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAAG 121 

MINIUM' 1 1 1 1! 1 1 1 M M 1 1 I ! I M M M i 1 1 1 1 1 M II 1 1 1 1 1 1 II 1 1 

Db 173 TGAAAGACAACCTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAAG 2J2 

rtv 122 AAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAAC 181 

y I Mill I Mil M llllll II I Ml II II MM II II II II Mill Mil I 

233 AGGTGTCAAAATCTCTGCAAGCAATGAAGGAAATTCTGTGTGGAACGAACGACAAGGAGC 2 92 
182 CCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGA 241 

MM III III II Ml IN II MM II II II I Ml III III I II MM MM 

2 9 3 CCCCTACAGAAGCAGTGGCTCAGCTGGCGCAGGAGCTCTACAGCAGCGGGTTGCTGGTGA 3 52 
■ \" < ACTGATAGCTGACtJTGCAGCTGATAGAC'TTTGAGGGAAAAAAAGATGTGACCCAGATAT 3 01 

, ' Mil ! I i| II M I M II II 1 1 II HI I i I II 1 1 II I li II II II 1 1 1 i 1 1 1 i 1 1 II 1 1 

•. 5? . CA CTCATAGCTGACCTGCAGCTCATAGACTTTGAGGGAAAAAAAGATGTGACC.CAGATAT 412' 
: • •■ " i • • CTAACAACATCTTGAGAAGAC AGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGTG 3 6 K 

| MM I MM I II I II I II M II i I 1 1 Ml II Ml III 1 1 Mill Mill 

,■,"■;! :; Tc:AACAACATCeTGAGAAGACAGATTGGTACACGGTGTCCTACTGTCGAGTACATCAGTT 475. 
» ••TCATCCTCATATCr-TGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTAC 1 21 : 

'•' ' Ml MIMI I M M M I II I M M Ml Mill II II I Ml II I M I M I MM I Mi = 

tf?a- CTCATCCTCACATQCTGTTTATGCTTCTCAAAGGCTATGAAGCCCCACAGATTGCCTTAC 532"' 
~>v '"2 GTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTCT 4 81. 

i M MINIMUM Mill II NIMH Mill III III IN III I MINI I I 

:i?3 GCTGTGGGATTATGCTAAGAGAGTGTATTCGACATGAGCCACTTGCCAAAATCATCCTAT b9.2 
i32 XTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCTT 541 

V liiiiMIII INN INN II II IN II M IN MM II MINN I MM 

5.93 TTTCTAATCAGTTCAGAGATTTCTTCAAGTATGTTGAGC1GTCCACCTTTGATATCGCTT 652 
•S d 2 CAGATGPCTTTGCTAC TTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGACT 601 

' || II M 1 1 1 1 1 1 1 1 1 M I 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 II 1 1 i I llllll II 

653 CAGATGCCTTCGCTACTTTTAAGGATTTGTTAACCAGACATAAAGTATTGGTAGCAGACT 712 



. 0 TCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGA 661 

' ■ " 1 1 HIM Mill Nil Ml Mill II III INI III M INI II II II I 

7 i 3 TCTTAGAACAAAATTATGACACTATTTTTGAAGACTATGAGAAACTGCTGNCATCTGAGA 772 
662 ATTATGTTACTAAGAG - ACAGTCTTTAAAGCTGCTAGGG - GAGCTGATCCTGGACCGTCA 719 

I 1 1 1| I II HIM III - 1 I I I I I I l-l MINIM I Ml II I III I I I 

773 ACTATGTGACAAAGAGAACATTCTTTAAAGTTGCTAGGGTGAGCTGATCCCTGGACCGCC 832 



Qy 72 0 CAACTTTGCCATCAT 734 

II II II III 

8 33 CACAATTTTCACCAT 84 7 



RESULT 14 
CD354831 
LOCUS, 
DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



University of Iowa 

Bento Soares, University of Iowa 
Bento Scares, University of Iowa 



CD354831 713 bp mRNA linear EST 29-MAY-2003 

UI-M-GM0-cge-i-10-0-UI . rl NIH_BMAP_GM0 Mus musculus cDNA clone 
IMAGE: 30361641 5', mRNA sequence. 
CD354831 

CD354831 .1 GI : 3114 73 3 2 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Sukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 713) ■ 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: Dr. Jim Lin, 

cDNA Library preparation: Dr. M. 

cDNA Library Arrayed by: Dr. M. 

DNA Sequencing by: Dr. M. Bento Spares, University of Iowa 

Clone Distribution: Distribution information can be found at 
http : / /genome . uiowa . edu/distribution/mousef 1 . html 

This clone was contributed by the" Brain Molecular Anatomy Project 
(BMAP) 

Seq. primer: pYX-5." 

Location/Qualifiers ' :< • • 

- • 1. .713 : - 

/organism="Mus musculus" 
/mol_type= "mRNA" 
/strain="C57BL/6" 
/db_xref="taxon: 10090" 
/clone- "IMAGE: 30361641" 
/tissue_type= "whole brain" 
/dev_stage="l, 5 and 15 days newborn" 
/lab_host="DH10B (Tl phage resistant) " 
'/clone_lib= " N I H_BMA? _GM0 » 

/note= "Organ: Brain; Vector: pYX- Asc; Site_l: EcoR I; 
Site 2 : Not I; The library was constructed according 
BonaTdo, Lennon and Soares, Genome Research, 6:791-806, 
1996. Denatured RNA was size fractionated on a 1% agarose 
gel. First strand cDNA synthesis was primed with oligo-dT 
primer containing a Not I site. Double strand cDNA was size 
selected according to mRNA size fraction, ligated with EcoR 
I adaptor, digested with 'Not I and then cloned 
directionally into pYX-Asc vector. The library tag 
sequence located between the Not I site and the polyA tail 
is CGAACTGAAT. This library was created for the University 
Iowa Brain Anatomy Project (BMAP) : 'Gene Discovery in the 
Developing Mouse Nervous System', supported by National 
Institute of Mental Health (NIMH), Hemin Chin, Ph.D., 



BASE COUNT 
ORIGIN 



program coordinator." 
220 a 159 c 140 g 192 t 



2 others 



Query Match 57.1%; Score 579.2; DB 14; Length 713; 

Best Local Similarity 89.9%; Pred. No. 4.4e-115; 

Matches 6*2; Conservative 0; Mismatches 70; Indels 2; Gaps 2; 

AAAAGATGTGACCCAGATATTTAACAACAT - CTTGAGAAGACAGATAGGCACTCGGAGTC 34 0 

IIIIIIIIIIIMMIIIIII MINIM I Mill MM! M II III III 

AAAAGATGTGACCCAGATATTCAACAAtATCCNTGAGAAGACAGATTGGTACACGGTGTC 6 0 
CTACTGTGGAGTATATTAGTGCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATG 4 00 

Ml II 1 1 Mill II III II III I Mil II Mill MMIM MM I 1 1 II 

CTACTGTCG AGTACATCAGTTCTCATCCTCACATCCTGTTTATGCTTCTCAAAGGCTATG 12 0 
AAGCCCCACAGATTGCCTTACGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAAC 460 

Ml Ml II IMMIMMMM lllll Ml III Ml Mill IMIMI II Mill I 

AAGCCCCACAGATTGCCTTACGCTGTGGGATTATGCTAAGAGAGTGTATTCGACATGAGC 1 8 0 
CACTTGCCAAAATCATCCTCTTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGT 52 0 

1 1| Mill IMMIMMMM Mill Ml II MM II Mill I Mill II Ml 

CACTTGCCAAAATCATCCTATTTTCTAATCAGTTCAGAGATTTCTTCAAGTATGTTGAGC 24 0 
TGTCAACATTTGATATTGCTTCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGAC 5 80, 

MM II M I! II II IMMIMMMM 1 1 1 II I M IMIIMI IIIMIMI 



Qy 


282 


Db' 


1 


Qy 


341 


Db 


5 1 


Qy ' 


4 01 


Db 


121 


«y 


-io-i 


lib 


131 


'■:¥ 


ij : ; i , 




>v-i 














Db 




Qy 


701 


Db 


421 


Qy 


761 


Db 


"4 81 


Qy 


-3,21 


Db 


541 


Qy 


881 


Db 


601 


Qy 


941 


Db 


.. 661 



! MM IT M M 1 1 1 II 1 1 1 M 1 1 1 M 1 1 1 1 1 M II I MM M 1 1 II 1 1 1 1 1 II 1 1 II I 



AGAAATTGCT'^CAGTCTGAGAATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGG 7 0 0 ;■ 

IMM llllll I MMIM Mill M IMIIMI IIIMIMI IMIMI I 

AGAAAC TGC TGC AAT C TGAG AJA CTATGTGAC A AAGAG AC AATC TTTAAAGTTGC TAGGTG 4 2 Or- 



IMIIMI I MMIM Mill II MM lllll 1 1 II 1 1 1 M 1 1 M 1 1 II II M 



ACCTGAAACTCATGATGAACCTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCT 82 0 

I MMIM II MM II MM I Mill M Ml Mill IIIMIMI M MMIM 

ACCTGA AACTGATGATGAACCTGCTTCGAGAC AAAAGTCCCAACATCCAATTCGAAGCCT 54 0 
TTCATGTTTTTAAGGTGTTTGTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCC 8 80 

I Mill IIIIMI I IIIIMI Mill II III Mill IMIIMI IMIIMI I 

TCCATGTCTTTAAGGTGTTTGTGGCCAGCCCCCACAAAACGCAGCCTATCGTGGAGATTC 5 0 0 
TGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGG 94 0 

I IMIMI II Ml II II MM II II MM MUM Mill III II MM MUM I I 

TGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAGCAGCTTTCAGAAAGAAAGGACAG 660 
ATGATGAGCAGTTCGCTGACGAGAAGAACTACTTGATTAAACAGATCCGAGACT 994 

I || MM MM IIIMIMI III lllll llllll MUM IMIMI 

ACGACGAGCAGTTTGCTGACGAGAAG - ACTACCTG ATTANACAGATTCGAGACT 713 



RESULT 15 
HSM073180 

ID HSM073180 standard; RNA; EST; 742 BP. 
XX 

AC BX4 8 3 012; 
XX 

SV BX483012.1 
XX 

DT 09-MAY-2003 (Rel . 75, Created) 

DT 09-MAY-2003 (Rel. 75, Last updated, Version 1) 
XX 

DE Homo sapiens mRNA; EST DKFZp68 bC08234_rl (from clone DKFZp686C08234 ) 

XX 

RW EST; expressed sequence tag. 

XX 

OS Homo sapiens (human) 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; Mammalia 

OC Eutheria; Primates; Catarrhini; Hominidae; Homo. 

XX 

RN [1] 

IIP 1-742 

RA Ottenwaelder B., Obermaier B., Deutschenbaur S., Mewes H.W., Weil B., 

RA _Amid C. , Osariger A. , Fobo G. , Han M . , Wiemann 3 . ; 

RT , • • 

RL Submitted (07-MAY-2003 ) to the EMBL/ GenBank/DDBJ databases. 

ftL MIPS, In/jolstaedter Landstr.l, D-85764 Neuherberg, GERMANY - . V. 

C':' This is rhe 5' sequence of the clone insert V 

OC Clone from S. Wiemann, Molecular Genome Analysis, German Cancer" 

CC • Research Center (DKFZ) ; Email s.wiemann@dkfz-he.idelberg.de; 

CC sequenced by MediGenomix (Mart irisried/Germany) within the cDNA 

CC sequencing consortium of the German Genome Project. • 

CC- No .sal sequence available. 

CC Tiiis clone (DKFZp686C08234 ) is available at the RZPD in Berlin: 

CC Please contact the RZPD: Ressourcenzentrum, Heubnerweg 5, 

CC 14059 Beriin-Charlottenburg, GERMANY; Email: clone@rzpd.de 

XX 

FH Key Locat ion/Qualif iers . \% 

TH - 

FT source 1. .74 2 

FT /db_xref ="taxon: 9606" 

FT. /mol_type-"mRNA" 

PI /organism="Homo sapiens" 

FT /clone="DKFZp686C0B234" 

FT /clohe_lib="686 (synonym- hlcc3) . Vector pSportl_Sfi; host 

FT DH10B; sites SfilA + SfixB" 

FT /dev_stage= "adult" 

FT- /tissue_type="cDNA-collection" 

XX 

SQ Sequence 742 BP; 256 A; 143 C; 162 G; 179 T; 2 other; 

Query Match 57.0%; Score 578; DB 2; Length 742; 
Best Loe>l Similarity 99.8%; Pred. No. 8e-115; 

Matches 578; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy i ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 60 

MIIMIIIIIIMIIIIIMIIIIIIIilllllllllilllllllllllMIIIIIIII 



164 ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 223 
61 CTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAA 120 
224 ScilUUS^ 

121 OAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAA 180 

344 i!;CCCCAACAGAAGCAGTGGCTC 403 
o , x ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAA^ 3 00 

464 YTTAACAACATCTTGAG 5 " 
, .... <~r^riiTrCTCATATCCTGTTTATGCTCCTCAAAdGATATGAi^GCCCCACAGATTGCCTTA 420 

- m n ii Tm MM 1 1 II 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 IN M 1 1 1 1 Ml ., 

,;,4 (iCTCATCCTCATATC^ ^ 
- VTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAA^^ 480, 

• V^muTTT iTlllillliMIMlMIIIIIMIIIII IIIIMIIIIIMilMI 

, . CGTTGTGGGATTATGf !^ 
, », ^TCTAATCAATTCAGA^ 54 °' 

704 TCAGATGCCTTTGCT 742 

completed: January 5, 2004, 03:18:09 
us : 2589 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen L 



OM nucleic -nucleic search, using sw model 

January 6 2004, 00:37:47 ; Search time 3965 Seconds. 
on: January b ( (without alignments) 

10462.134 Million cell updates/sec 

Title: QS-10-088-872-1 

Perfect scor- : 1014 taaaqaaaacggccccttga 1014 

Senuence: 1 atgaaaaaaatgcctttgtt cgaayddd yy 

Sco^ino table: IDENT IT Y_NUC 

Gapop 10.0 , Gapext 1.0 

•Searched: 2888711 seqs, 20454813386 residues 

Total number of hits satisfying chosen parameters: 5777422 

Nj-i-'iniuui OB seq length: 0 

Maximum DB esq length: 2000000000 

" Vost -presto. : ..:.ig: ' Minimum Match 0% 

. ' V Maximum Match 100% 

[listing first 45 summaries 



• base 



GenEmbl : * 
;>. : gb_ba : * 
2: gb_htg:* 
3 : gb _in : * 

4 : 'jb_pm : * 

5 : gb_oy: * 

6 : gb J>at : * 

7 : gbj?h : * 

8 : gb_J>l : * 
9 : gb_pr : * 
10: gbjro:* 
11: gb_sts:* 
12: gb_sy:* 
13: gb_un:* 
14 : gb_vi : * 
15: em_ba : + 
16: em_fun:* 
17: emjhum:* 
18: em_in:* 
1 9 : em_mu : * 
2 0 : em_om : * 
21: em_or:* 
22: emov:* 

2 3 : em_pat : * 

24: em__ph:* 

25: em_pl:* 

2 6 : em_ro : * 

27 : em StS : * 



28 : 


em_un : * 


29 : 


em vi : * 


30 : 


em_htg_hum : * 


31: 


em_htg_inv : * 


32 : 


em_htg_other : * 


33 : 


em_htg_mu3 : * 


34: 


em_htg_pln : * 


35: 


em htg_rod:* 


36 : 


em_htg_mam : * 


37 : 


em_htg_vrt : * 


38: 


em_sy : * 


39: 


em_htgo_hum : * 


40 : 


em_htgo_mus : * 


41: 


em htgo_other : * 



D^d- No is the number of results predicted by chance to have a . 
si greater than or equal to the score of there suit being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

IZo. Score Match Length I)B 



ID 





1014 


100 . 


0 


1014 


6 


AX105381 




1010.8 


99. 


/ 


1344 


6 


AR0 97 3 61 




loic . e 


99. 


7 * 


1344 


•6 


AR203365 




1009.2 


■ 99. 


5 


1491 


9. 


BC010 9 93 ... 




992 . 8 


97 . 


9 


2002 


: . > 


BD157871 


6 


992 . 3 


97. 


9 


2002 


9 


AK022639 


7 




84 . 


9 


13 5 9 


10 


BC016128 


8 


35 3 . 8 


34 . 


7 


1530 


10 


BC016 54 6 




7 70.6 


76 . 


0 


82 2 


6 


BD147463 


10 


•_;C4 . 5 


67 . 


5 


831 


6 


BD079551 


11 


•:::=32 . 6 


57 . 


5 


102 6 


6 


AX061831 


12 


$82 .6 


57 . 


5 


3281 


6 


AX082322 


13 


582 . 6 


57 . 


5 


3761 


9 


3C020570 


14 


531 . 6 


57 . 


4 


1680 


9 


AF151824 


■! r 


5 81 . 6 


57 . 


4 


3466 


9 


AF113536 


16 


568 . 8 


56 


1 


1947 


10 


BC020041 


17 


567.2 


55 


9 


2322 


10 


S51858 


18 


552 . 8 


54 


5 


3346 


9 


AK000804 


19 


541 .6 


53 


4 


1053 


6 


AX105727 


20 


533.6 


. 52 


.6 


851 


9 


AY211923 


21 


520 . 2 


51 


. 3 


333 


6 


BD079552 


.22 


47 8. 8 


47 


.2 


2366 


5 


BC044172 


2 3 


363 . 3 


36 


.4 


2991 


10 


BC029053 


24 


365 


36 


.0 


158599 


2 


AC101941 


25 


362.8 


35 


. 8 


1894 


3 


AY128438 


2 6 


362.8 


35 


. 8 


8324 


2 


AC019926 


27 


362 . 8 


35 


. 8 


201313 


3 


AC010688 


28 


362 . 8 


3 5 


. 8 


281101 


3 


AE003526 


29 


361 .2 


35 


.6 


2213 


3 


AB000402 


30 


270 


26 


.6 


1452 


9 


AK026335 


31 


255 


25 


.1 


1466 


8 


CPR2 386 32 


32 


240 . 8 


23 


.7 


3514 


10 BC034159 


33 


210 .4 


20 


.7 


1474 


8 


AY088359 



Description- 

AX105381 Sequence 
AR0 97 3 61 Sequence 
AR203365 Sequence 
BC010993 Homo sapL 
BD157871 Primer fo 
AK02 2 63 9 Homo sapi 
BC016123 Mus'museu 
BC016546 Mus museu 
BD147463 Primer fo' 
BD079551 Cancer-as- 
AX061831 Sequence^- 
AX082322 Sequence- 
BC020570 Homo sapi 
AF151824 Homo sapi 
AF113536 Homo sapi 
BC020041 Mus muscu 
S51858 M025. gene [ 
AK000804 Homo sapi 
AX105727 Sequence 
AY21192 3 Homo sapi 
BD079552 Cancer-as 
3C044172 Danio rer 

BC029053 Mus muscu 
AC101941 Mus muscu 
AY128438 Drosophil 
AC019926 Drosophil 
AC010688 Drosophil 
AE003526 Drosophil 
AB000402 Drosophil 
AK0263 3 5 Homo sapi 
AJ238632 Chlorella 

BC034159 Mus muscu 
AY0 8 83 5 9 Arabidops 



H 

If' 



t 



c 
c 
c 
c 
c 
c 
c 



34 
35 
36 
37 

3 8 
39 
40 
41 
42 
43 
44 

4 5 



209 . 6 
200.2 
200 .2 
195 
195 
186 .4 
185-4 
185 .4 
185 .4 
172.8 
16 9.8 
166.6 



20.7 
19.7 
19.7 
19-2 
19.2 
18 .4 
18.3 



1570 
1032 
1519 
387 
387 
89137 
89807 



18.3 349980 

18.3 349930 
17.0 159863 
16.7 722 

16.4 700 



8 

8 

8 

6 

6 

8 

9 

6 

6 

9 

6 

6 



AF428296 

AY055792 

AF380659 

AX333971 

AX407834 

AC006284 

AL138875 

AX711879 

AX739961 

AL136218 

AX283846 

AX283847 



AF42 82 96 Arabidops 
AY055792 Arabidops 
AF380659 Arabidops 
AX333971 Sequence 
AX407 834 Sequence 
AC0 062 84 Arabidops 
AL13 8 87 5 Human DNA 
AX711879 Sequence 
AX739961 Sequence 
AL13 6218 Human DNA 
AX2 83 846 Sequence 
AX283347 Sequence 



ALIGNMENTS 



RESULT 1 
AX1053 81 
LCCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 

r;o:;?.cE 

O^C?NTSM 



AX105381 



1014 bp 



DNA 



linear 



PAT 30-APR-2001 



Sequence 1 from Patent WO0123552. 
AX105381 

AX105381.1 GI:13921508 

Homo sapiens (human) 
Liomo sapiens 

Sukaryota; Metazoa; Chordata; Cranrata; 
Mammalia; Eutheria; Primates; Catarrhim 



Vertebrata; Euteleostomi ; 
Hominidae; Homo. 



Y'l^T-lRENCE 
AUTHORS 
T3:r'LE 

OOURiSTAL 
source 



CDG 



binding protein 

-Patent; WO 0123552-A 1 05-APR-2001; 

MERCK PATENT GmbH (DE) ; ... 

Location/Qualifiers 

1. .1014 

/organism- "Homo sapiens" 
/ mo l_type=" genomic DNA" 
/db_xref ="taxon: 960 6" 

1. .1014 • 

/note= "unnamed protein product" 

/codon start=l : 
/protein _id= "CAC3 773 5 . 1 " 

/traS^ 
SLQ^KEILCOT 

MTTRRO T GTR Q PTVE YI S AHPHI LFMLLKGYE APQ I ALRCG IMLREC IRHEPLAKI I L 

fsnqfSf^ 

S ENYVTKRQSLKLLGEL ILDRHNFAIMTKY I S KPENLKLMMNLLRDKS ^^^^ 



BAS- 
QRH 



<] COUNT 

;in 



340 



TAP" 
a 205 



209 g 



260 t 



Query Watch 
Best Local Similarity 
Matches 1014; Conservative 0; 



100 . 0%; 
100 . 0%; 



Score 1014; DB 6; Length 1014; 
Pred. No. 6.9e-238; 

Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



Qy 

Db 



; L ATGAAAAAAATGCCTTTGTTTAGTAAATCAC^ ™ 
S1 C^AAAGACAATTTGGCCATTTTGGAAAAGC^ 

6i UgI^i^^ 120 

■« n n-n T\ ion 



121 GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGT^ "0 



Qy 

Qy 

Db 

QY 
Db 

Qy 

Db 

Qy 

Db ' 

Qy 

Db 



iP , CCCCCAACAGAAGCAGTGGCTCAGC^^ "° 

1B1 240 

241 ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAA^ ^00 

301 TTTAA.C AAC ATCTTGAGAAGAC AG AT AGGCAC *y ^j*'??^*^'^" '^'^'^j' '^"^'y^^'^^'l^'^'^'^'T'^'^'^l'T ^ 
M iiiiicAA^^ 

,, : iCTCATCCTCATATCCTGTTTATGCTC^ 

CGTTGTGGX3ATTATGCTGAGAGAATGT 4 80" 
.^UgTTG^ 

1TTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTOT ^ 

481 TTTTCTAATCAATTCAGAGATTTC ^ 

541 TC AGATGCCTTTGCTACT'tTCAAGGATTTACTAACCAGACATAAA 600'. 

S41 i'CAGATGCCTTTGCTACTTT^ 600 

sel TTCTTAGAACAAAATTACGACACTATTTT^ 660 

60i TTCTTAGAACAAAATTACGACACT^ 660 

SGi AATTATGTTACTAAGAGACAGTCTTTAAAGCTC 720= 

720 

7 „ AACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGA 780. 

7 ,, jvACTTTGCCATCATG 780 

7 31 CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCOTTTCATGTT^ B40 

, el yiy^ 840 



841 GTC3C3 C C A.C5T C C T G A.C A.C^VC3C C'r-A-'T'TG TGG.^Cj.AT C CTCjT *I7 C C C-^A-A. 900 
841 i^i^ 

001 CTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATG^ 960 



Qy 



961 GAGAAGAACTACTTGATTAAACAGATCCGAGAOT 

1 1 1 1 ii 1 1 1 1 1 uiiimimiiiMMiM^ 



1014 



Db 961 



ILgIIgIL^ 1014 



RESULT 2 

ARC97361 DNA linear FAT 14-FEB-2001 

LOGUS AR097361 F 

DEFINITION Sequence 2 from patent US 6071721. 

ACCESSION AR097361 

VERSION AR097361.1 01:12806091. 
KEYWORDS 1 

SOURCE Unknown . 

ORGANISM Unknown. 

Unclassified. 

REFERENCE 1 (bases 1 to 1344) ~ orlev H c a nd Gorgone,G.A. 

AUTHORS Tang, Y. Tom., Guegler , K.u - , v-orley,N.o. an y 

•v-YTLE calcium binding protein 
'< M'PNAL Patent: US 6071721-A 2 .06- JUN -2000 ; 
v , . V , URKS Location/Qualif iers 

1 1344 

source 1 ♦ ■ 

/organism- "unknown" 

BASE COUNT 450 a ~261 c 280 g 353 t 

ORIGIN 

■ . v. 9 9 7%; Score 1010.8; DB 6; Length 1344; 

Querv Match ' „ ^ M ^ * 9^-937. 

Best- Local Similarity 99.8%; ^ ^ ^2e 237 0; Gaps .. 0; 

Matches- 1012; Conservative 0; Mismatches 

1 ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCC *° 

61 CTGAAAdACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAA(MCTTCAGAA 120 
-91 GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 180 
18 1 CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCT 240 



Qy 

Db 

Qy 

Db 

Qy 

Db' 

Qy 



Qy 

Db '■ 



QY 
Db 

QY 
Db 

QY 
Db 

QY 
Db 
Qy 

Db 
QY 



Db 

Qy 
Db 



QY 

Db 

QY 
Db 



301 

4 2 A. 
361 
48 4 
421 
544 
4 81 
6-G4. 
541 

601 
724 

£01 



TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGA^ 

GCTCATCCTCATATCCTGTTTATGCTGCTCAAAGGATATG^ 

CGTTGTGGGATTATGCTGAGAGAATGTATTCGACA^ 

TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTC 

TCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTC 
IUgUgG^ 

TTCTTAGAACAAAATTACGACACTATTTTTGAAGACT 

AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTC 
: i I I I I I I I I II I I I I I I I I I I ! I M I I I I I 1 1 LLLl[LL^lrnr*nvn-n*rraTCAC 



1 1 1 11 ' 1 1 ! ! ! ! I I M ! I I I [III'IIIa^U^^ 




731 
^04 
841 
964 
901 

1024 
951 

1084 



LlU-ii^^ 1 ^' 

GTGGCCAGTCCTCACAAAACACAGCGTATTGTGGAGATCCTC 

QTGGCCAGTCCTCACAAAACACAGC^ 
CTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGAC^ 

GAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAG "14 

iiUiAUc^^ 1137 



360 
4 83 , 
420 
543 
480 
603. 
540 
66.3 : . 
600 
723 
660 
783 
720- 
34 3-^ '! 
780 , 
90 3, 
840 
963 
900 
1C23 
960 
1083 



RESULT 3 

AR203365 l344 bp £ W a 

SSmo. 2 fr- patent US „.»71. 

KEYWORDS 

SOURCE Unknown . 



linear PAT 20-JUN-200-2 



ORGANISM Unknown. 

Unclassified. 
REFERENCE I (bases 1 to 1344} 



Qy 

Db 



1FERENCE 1 (bases 1 to 1344) C orley,N.C. and Gorgone,G.A. 

AUTHORS Tang, Y.Tom., Guegler , K. J - , Cori y. 

TITLE Calcium binding protein 
T x. ±_ ,to ctp;r^71-A 2 02-APR-2002, 

JOURNAL Patent: US 636 537 ^ * 
FEATURES Locatxon/Qualifiers 

SOUrCe / organism=" unknown » 

BASE COUNT 450a 261c 280 g 353 t 

ORIGIN 

99 7%; Score 1010.8; DB 6; Length 1344; 
Query Match N 4.2e-237; 

, 0I ™™™™ 360 

... 4 USUiiM 483 ' 
... 34 SSK^^ 



Db 

Qy 
Qy 

"ijfe 

QY 
Db 



Dr. 

Qy 
Qy 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



72 4 

661 

7 84 

721 

844. 

781 
904 
341 
964 
301 

1024 
961 
I OS'!- 



-TGGCCAGTCCTCACAAAACACAGCC^^ 



V.y j. x_i ^ ~ • - 

^TCATTGAGTTTCTGAGCAGCTTCCAA^ 

■^AC^AGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA 1014 



783 
720 ■ 
843 
780. 
903 , 
840 
963 
900 
102 3 
960 
10.83. 



3 ..')109>>J 
LOCUS ' - 
DEFINITION 

ACCESSION 
VERSION 

KEYWORDS 

SOURCE 

ORGAXTISM 



reference 

authors 

TITLE 
JOURNAL,: 



REMARK 
COMMENT 



1491 bp mRNA linear PRX 25-JUL-200V 
Homo sapiens, hypothetical protein .LJ12577, r l 
IMAGE: 3956127, mRNA, complete cds . 
BC010993 

3C010993.X GI :15012172 . , ; . 

MGC. 

Homo sapiens (human) 

Homo sapiens vertebrata; Euteleostomi ; 

S'S; — — 

I (bases 1 to 1491) ^ . 

Strausberg,R- 

Direct Submission q of Health, Mammalian 

Submitted (23-JUL-2001) National ^st.tutes of He , 
Gene Collection (MGC) , Cancer Genomics °^e ftro o< 
Institute, 31 Center Drive, Room 11A03, Bethesaa, 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact: MGC help desk 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC 

bi J . . / inqfitute for Systems Biology 
DNA Sequencing by: Institute lul y 

h*" tp : / /www . sys temsbiology - org 



G . eene , Mark Ketteman and Anuradha Madan 

passed the following selection cnteri 
FEATURES " Location/Qualxf xers 

sourCS /organism="Homd sapiens" 

/mol_type="mRNA" 
/ db_xr e f = " Locus ID : 8 1 6 1 ) 
/db xref="taxon:9606" 
/clone="MGC: 15031 IMAGE : 395ol27 
/tissue tyP e="Placenta, choriocarcinoma 
/ c lone_l ib="N IH_ MGC_2 1 " 
/lab _host=="DH10B-R" 
/note-" Vector: pOTB7» 
CDn 416. .1235 

/codon start=l IT1 ,r 17 » 

/products-hypothetical protexn .W12577 

/protein_id="AAH10993-l" 

/db_xref = "GI : 15012173 « ^ pppTEAVAQLAQ ELYSSGLLVThTADLQLIDFE 
/ translation= '^^^gp^gy j^aHPHIIjFMIiLKGYEAPQIAIjRCGIMLRECI 

jJIKOIRDLKKTAP" : .. ■ .mT 

., > ,., ; roi ... 503 a "290 c 305 g 393 t ,. ; 

0 ^ U ' 1 ' . . 99 . 5 %; Score 1009.2; DB 9; Length 1491; 

r^Tf„r - Mismatches - — ~ 

y ... : m,: 
* " „ 120 

iO'L TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 



Db- 
QY 

Qy 

Db 

Qy 

Db 

Qy 



Db 

Qy 

Db 

QY 
Db 

QY 
Db 

Qy 

Dfo 

Qy 

Db 

Qy 

, Db 



Db 
Qy 

Db 



, - iSicclcicJJiiciciS 1171 

% 1014 ■ 



ssa™* sur«« - cdha - - 

ACCESSION BU157871 , 7S63629 
ORGANISM Home sapiens 



. v p^niata- Vertebrata; Euteleostomi ; 
Eukaryota; Metazoa; Chordata; t; Hominidae; H omo. 

Mammalia; Eutheria; Primates, Catarrhr 
REFERENCE 1 (bases 1 to J002) H ayashi,K., Saito, K. , Yamamoto, J 

AUTHORS Ota.T.. ^oga 1( T ^ 3h ^ ats ;, A ., Na gai,K. and Otsukx.T. 

Ishii.S., Sugiyama T. w *** fch cDNA an d use thereof 

TITLE P,i- for synthe S -- g ^J^oo, , 

■ JOURNAL gg^jx RESEARCH INSTITUTE 



PN 

PD 09-JUL-2002 



S — V— .SBISKO SUOXV^.,! — . 

" KF.IICHI KiGAI.TETSUJI OTSUKI 

■^ 1 ,,.,.C.™/.7.C OT =«/».C»»/».^'»- <M " /aliCia,B/,C 
prime* 2 . „ /niia l i f iers 



L E)J""" — 

Location/Qualifiers 

(127). -(993) 



Location/Qualifiers 
FEATUrtKS x 2002 

3 ° u7 -'- e / 0 rganism= "Homo sapiens" 

/mol type-" genomic DNA" 
/db xr«E- ,, taxon:9606" . 
t Q „ =' "418 '• 463 g 527 L 
BASE COUNT 594 a 418 - 

ORI«IW ' t .... •■ 

am «- DB 6; Lenqth 2002; 
97 9%; Score 952.8, o, a ^ 

vV^ hes $94; Conservative u, 

^.-a rri7vrnrprp7\ 7\ r*Z\ AP ATCTTGAGA 318 



Qy 

Db 

Qy 
Qy 

Db 

Qy 
Db 
Qy 



i cji GCTCAGCTAUi-A^*»-«™^ 



Db 

QY 
Db 

QY 
Db 

QY 
Db 

QY 
Db 

QY 
Db 

QY 



l:b 

QY 
Db 

Qy 

Db 

Qy 

Db 



301 AGACAOATACGGACTCGGAGTG C~0— TMTC CT« T CC~CTO 360 

suss «»■ 

, 99 BBBWW"™"^^ 

—nn mm » 

.... , SSS^ M * 

,« jJJ\CAGATCGGAGACTTGAAGAAAACGGCCCCTTGA 956 



RESULT 6 



AX022639 2002 bp mRNA linear PRI 01-AUG-2002 

LOCUS AK022639 FLJ12577 fis , clone NT2RM4001047 , highly similar 

DEFINITION Homo sapiens cDNA FLJ125// 

to M025 PROTEIN. 

A""-ESSI0N 2\K022639 

VERSION AK022639.1 GI:10434146 sequence ) . 

KEYWORDS oligo capping; fas full 

SOURCE Korao sapiens (human) 

ORGANISM Homo sapiens 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL- 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



CURES 



CDS 



--cffi ass zssszsssssr* 

Mammalia; Eutnerid, 

1 , . „ Qncivama.T. , Otsuki,T., Suzuki, Y., 

Nis hikawa,T., Nagax,K Sugano, S. k Kondo , H ., 

Wagatsuma M. Hosoxrx T^ Kaku^ Ishida , s ., Mu , a kawa,K., 

Sugawara,M., TaKanasm,n , xcimura K , Murakami, K., 

Ono.Y.. TaKiguchi.s »»»" abe ^ m ; m ™ a ; K „; te matsu,A., 

Nakamura , Y . , Naganarx, , 

NEDO human cDNA sequencing pro : ect 

Unpublished 

9 (bases 1 to 2002) 

Isogai,T. and Ctsuki,T. . 

Direct Submission Hp1 ix Research Institute, 

Submitted (23-AUG-2000, Takao Jsoga Helxx Res^ 

Genomics Laboratory; 1532-3 Yana * 39?5( Fax: 81-438-52-3986) 

(E-mail :genomics@hrx. co 3P, Tei ^^ guDDO rted by Ministry of 
• . EDO human cDNA sfequ^ncxng project supported / ; insert 
In ternational Trade ^^ndustry ot ^^ cDNA library 

sequencing: Research Association to d clone selection: 

construction, 5'- J 3 ■ -end | ey Technology Center 

Helix Research ^^J^J^ i n 2itu?e of Medical Science, 
etc.) and Department of Virology, -n 

university of Tokyo. - • • ' .' . 

Location/Qualifiers . 

• . . 2002 

/ oovani sm= " Homo s ap i ens " 
/mol_type="niRNA" 
/ db xre f = " t axon : 9 6 0 6 " 
/clone="NT2RM4001047" 
/cell_line="NT2» 
/cell type= " teratocarcmoma 

neuronal precursor cells . " 

/note= ;: unnamed protein product" 

/codon_start=l 
/protein_id="BAB14147.1" 

/db. t xref --GI : 104341 ^" ^ssgllvtlxadLQLIDFE 

/cranslation=»MKEILCGT^KEPPrEA V Q GYEApQIALRCGIMLRE CI 



BASE COUNT 
ORIGIN 



594 a 



LIKQIRDLKKTAP" 

418 c 463 g 527 t 



97 . Score 992 .3; DB 9; Length 2002; 

Querv Matcn OQ ' QSr Pred No. Lie -2 32; 

B«t Loci Similarity 99 8», 2; indels 0; Gaps 

Matches 93*; Conservative 



0; 



Qy 

Db 



95.5a- Conservative 



QY 
Db 

QY 
Db 

QY 
Db 

QY 
Db 

QY 

L Db. 

QY 



QY 



Dp 



DO 

QY 
Db ; 

QY 
Db 

QY 
Db 



61 

13 £ 

121 

199 

181 

2S3 
241 
319 
301 
37 9 
3 61 

:J9 

421 

-* . L 

3 3-D 



:i4l 
S19 
601 
67 9 
661 
739. 
721 
799 
7 01 



859 



138 
120 
198 
180 
258 
24.0 
318 
300 
378 
360 
438 . 
420 
4 9£ 

4 80 

5 58 : 
540:; ; 
6 18- 
60 0' 
67-8 
660 
738 
720 
798 
780 
858 
840 
918 
900 



841 



Db 9 C1 



Dr 96 : 



1014 



RESULT 7 
BC016128 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 



AltSChul # S.E\ 
Hopkins, R-F. , 
Oiatchenko, L . 
Stapleton,M. , 
ScheetZrfT.S.-i 
Carninci,P- , 
.Abramspn,R-i>- 



TITLE 

\JOURNAL 
• MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



D on 16-APR-2003 
1359 bp mRNA linear ROD 16 AM* 

— X»U -» — 

IMAGE:4911640) , complete cds . 
BC016128 

BC016128.1 GI: 16359341 

MGC . v 
Mus m usculus (house mouse) 

Mus musculus ,-rariiata; Vertebrata; Euteleostomi; 

-ssstjes ass Muritoi 

Klausner,R.D., Collins , F. S - , Wag Schae fer,CF. , Bhat.N.K. , 

- , Zeeberg,B., Buetow,* , Wan g,J., Hsieh.F., 

jordan.H. , «oore^ ; . " Rubin , 3 . M . , Hong,L. , 

. Marusma.K., Farmer, A ^ Casavaiitf T.L. , 
scares, M.B Bona^in T-i • Toshiyuki , S . , 

Prange , C . , Aaba , b _ ' M cEwan , P . J • . 

Ab ramson,R.D.. Mullah y.p. *-^££ p .h. , Richards, S.. 
McK.ernan,K.J., Malek, J- A , Gunar Hulyk.S.W., 
Wbrley.X.C. Hale, . - , ^rcxa^M. Y ; ^ Gi bbs,R.A. , 

Villalon,D.K., ^'^^M Madan.A., Rodrigues , S . , 
Fahey.J-, Heltpn,E. , A Young , A . C . , Shevchenko , Y . , 

3a nchez,A., Whiting.M.. M f * n ' \; uchman , j.w. , Green,E.D 
Bouffard,G.G., Blakesley^W - , J Schlnutz , J ., Myers, R.M.. 

bickson.M-C, Rodr ^ e ^ s ^' M ^, skaleka.U.. Smailus , D - E - , 
3u tterfield,Y.S., Krzywmskr M M arra,M.A- 

SSS Si" JlSii"^'- -e than 13,000 f ull-leng t h 

ST Si^i^ (2002) : 

22388257 
12477932 

2 (bases 1 to 1359) 

Strausberg, R- , 
Direct Submission , TTiqHtutes Q f Health, Mammalian 

USA • „- TTRL- http://mgc.nci.nih.gov 

NIH-MGC Project URL . nttp //a 
Contact: MGC help desk 

Email- cgapbs-r^ail^nrh^ov ^ 

Technologies, Inc. 



~a v™- The I M A.G.E. Consortium ( LLNL ) 
cDNA Library Arrayed oy. The l.u.a 

DNA Sequencing by. Genome Se ^»"^£ 
BC Cancer Agency, Vancouver , BC , Canada 

info@bcgsc.bc.ca yaron Butterf ield, 

Steven Jones, Jennif e« Bosdet, Yar « ^ ^ 

Susanna Chan, Readman Chru, Chris fjeii, Soo 
Letticia Hsiao, ^l^lZT^eZT. Candrce' McLeavy , Steven 
Sen Lee, Vlctor ^ ng .' ^"iisa Prabhu, Parvaneh Saeedi, Jacqueline 
Ness, Pawan Pandoh Anna-Liisa ™™' Lorraine Spence , Jeff Stott, 
Schein, Duane Smarlus Michael Smith Lor P v 

Michael Thorne, Miranada Tsai, Natas 3 a van aen 
George Yang, Scott Zuyderduyn, Marco Marra. 

•v • M pr ri-me distribution information can be found 
Clone distribution: MGC cone distrio //image. llnl .gov 

trough the I. M. A.G.E. Consortium/LLNL at . http.//imag 

Series = ^f ^^™ ^1 ^tn sequencing because it 
Sssed the foll^nfselection criteria: Similarity but not , 
identity to protein. 
^rrpc.i-S Location/Qualifiers . _ 

/organisms "Mus musculus" 

/mol type="mRNA" 

/strain="FVB/N" 

/db xref ="taxon: 10090" 

/^lnne-"MGC:28889 IMAGE : 4911640" 

/clone- ii^- zo ° ,„ p v 0 ,d female mouse" 

/tissue_type="Salivary gland, 10 wees o^a r - 

/clone_lib="NCI_CGAP_SG2 » 
/lab host-"DH10B" 

/note= "Vector : . pCMV- SP0RT6 » • .« ; 
1. .1359 

/oene="1500031K13Rik" 

/note = "synonyms: 4930520C08Rik, ..2810425O13Rxk- 
/db xref="LocusID: 69008" 
/db'~xref="MGI: 1916258" 

.,-,3 262. .1266 

/codon_start=l 

/product="150003lK13Rik protein" 
/protein_id="AAH1612 8 . 1" 
/db_xref="GI: 16359342" 

/^-^ xr ^^"^° C ^^p^ V g^g^pj^jYj^jLKDNLAILEKQDKKTDKASEEVSKSLO 

STFDIASDAFATFKDLLTRHKVLVADFLEQNYDTIFEDYEKLLQSEN 
mvD^CT^^l'cE^LDRHNFTIMTKYTSKPENLKLMMNLIjRDKSPNIQFEAFHVFKV 

^IsPnKX^ 

It ... 

BASE COUNT 418 a 301 C 294 g 346 t 

ORIGIN 

84 9%; Score 860.4; DB 10 ; Length 1359; 

Query. Matcn _ 3-2e-200; 



innmiiiTmiiTiiniiiiiiiinimiMiiii 



53 ftTGAAAAAAATGCCTTTGTTrAGTAAATCACACAAAAATCCAGCAGAAATTGTCAAAATT 312 

„ ctgaaagacaai™ggca™i" 120 

ul GAAGTGTCTA^TCA^^ 180 

3,3 LgUILJJM^^ " 2 

„3iMiGiiociiS 

241 ACACT3ATAGCTGACCTGCAGCTG— ~ "» 
301 1'^^AAGAACATCTTGAGAAGACAGATAGGCACTCGGAGTC^ 

2 6 , "CTCATCCTGATATCCTGTTTATGC^^ «<> 

, . , GOTT^GGGATTATGCTGAGAGAAT^^ > =« 

;:: u c isiM^ 

. ,., ^TTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 540 

341 TCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGAGATAAAGTC «~ 

„3 TCAGATGCCTTCGCTACTTTTAAGG^^ '* 
TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAMTTGCTTCAGTCTGAG 660 

;;; uss^^ - 

56 1 AATTATOTTACTAAGAGACAGTGTTTAAAGGTGGTAGGGGAGGTGAT^ ™ 

::: ii^MM « 

U AACTTTGCCATCATGACAAMSTATATCAGCAAGCCGGAGAAC^ 780 

;: eikm^m - 

7C1 ctcgttggggataaaagtgcgaagatcgagtttgaaggctttcatg™ 840 

8 ,i gtggccagtggtgacaaaacaca^ »•» 

1033 GTGGCCAGCCCCCACwJ^ACGCAGrc 



RESULT 8 
BC016546 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

.JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



1M0 bo mRNA linear ROD 16-APR-20C3 

™L» B rikew c o» 9 e„e, — .«» cao„e „ 0 C, 2 ^ 

IMAGE: 3595339) , complete cds . 

BC016546 

BC016546.1 GI : 16741456 

MGC. , 

Mus musculus (house mouse) 

MUs musculus rvaniata- Vertebrata; Euteleostomi ; 

EES' S— i,- — • Murinae, Mus. 

itrS:^ .:ieUi d .--^^ ; -i^?;w.a. D . 

Klausner.R.D., Colons ,F . S . Wagner, L schaef er , C . F • , Bhat,N.K., 
Altschul.S.F.. Zeeberg.B Bu f ow ' K ^ ' s f ; Wang , j. , Hsieh.F., 
;I opkins,R.F,, Jordan, H Moore^, ^"^.^V H ong,L. , 

Stapleton.M.. Scares ,n .» , UsdjnT .B.. Toshiyuki , S . , 

Scheetz^E. . ^rr^? V — ' 

Carnxncx,P-, Prang C , Bosak , s . A ., McEwan,P.J., 

,.bramson,R.D Gunaratne ,P.H. , Richards, S 

McKernan , K . J - , MaJ.eK,o , Hulyk,S.W., 
Worley,K,C, ^ le ' S ''^ C1 ^ e g ; e ; d : : 'Lu,X.! Gibbs,R.A., 
Villalon.D.X. . ^'^ tem ^ rgr M adan,A. , Rodrigues,S. , 
Fahey,J., Helton, E . , Ketteman, shevchenko , Y . , 

Sanchez , A. , Whiting, M Mad an, A - Green , E . D ., 
Bouffard,G G. • Blakesley^- , , Schmutz , J - , Myers,R.M. 

Dickson, M.C., Rodriguez, A. L.,^ skalska;U ., Smailus , D . E . , 
3utterfield,Y.S., Krzywmski MI., Skals > 

Schnerch, A. , Sc hein < * f ^fthan 15,000 full- length 

Generation and initial andiy^ 

r,?r"ci a Ti°r% 9 «... 

22.388257 
12477932 

2 (bases 1 to 1530) 
Strausberg,R. 

ljSA • ttpt ■ ht to -//mqc. nci.nih.gov 

NIH-MGC Project URL. nctp.//iy 

Contact: MGC help desk 

Email- cqapbs-r@mail-nih.gov 

Sssue Procurement: Jeffrey Green M - D . 



C0» f' P "r^" : THi £ I St ^Consortium UUL) 

^WmX^ - Human « 

Sequencing Center 
rpnter code: BCM-HGSC 

.5S Ste : http : / / www . hgsc . bcm . tmc . edu/cdna/ 

Contact: amg@bcm.tmc.edu Hulyk, S.W.. Leulseged, H. , 

Gunaratne, P.H., Garcia, A Lu, X H« y, Nanavati> 

Kowis, C.R., Sneed, A.J., Martin, 
A.N., Gibbs, R.A. 

frrouuh the I.M.X.G.B. consortium/LLHL at . ntcp 
Series, « "J^ectedTo; £»n "e„ g th sequencing because it 
StsefS rilSnf-^tion criteria. Similarity »ut not 
identity to protein. 
FUTURES Location/Qualifiers , ; 

1. .1530 

/organism="Mus musculus" 
/ m ol_type="mRNA" 
/strain="FVB/N" 
/db xref="taxon: 10090" 

/clone-MGC: 27972 IMAGE^3595339 _i]i a q mode l. Infiltrating 
/tissue type = "Mammary tumor. -3U) lay 
ductal cartinoma. 5 month old virgin mouse." 
/clone_lib-- -NCI_CGAP „Jtem6 " 
/lab host= 'DH10B" 
/note=" Vector: pCMV- SPORTS". 

/gene"lS00031K13Rik^ '■ 2810 425O13Rik" 
/note=" synonyms : 49iUb^ULUDKi^< 
/db__xref = "LocusID -.69008" 
/db_xref="MGI:1916258" 

;»7 9 . .1283 

/codon_start=l i 
/product="1500031K13Rik protein 

/protein id= " AAH16546 . 1 » 
/db_xref="GI: 1674145"" 



'jour: 



/db_xref = : LocusID:69008^ ^^^kqdkktdkaSEEVSKSLQ 
/ trans lat ion= MPLFSKSHKNPAEIVKi EGKKDVXQI FNN i L 

QFRDFFKYVELSTFDIASDAFATF^LLTRH^ 



BASE- COUNT 498a 313c 326 g 393 t 

84.7%; Score 858.8; DB 10; Length 1530; 



Query bV\\:oh 
Bc^t Lc 
Matches 

Qy 

Db 



Query M<--5:c:i q a'a%- Pred. No. 8e-200; 



61 CTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 120 

MUM I! Ml 1 1 1 1 ! 1 1 1 1 1 i 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 ! 1 1 1 ! i 1 1 1 ! 1 1 M 1 1 

3 30 CTGAAAGACAACCTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 38 9 
12V GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 18 0 

" || Mill II HI I I M MM II II I II II M M 

3 9 0 GAGGTGTCAAAATCTCTGCAAGCAATGAAGGAAATTCTGTGTGGAACGAACGACAAGGAG 44 9 
1 8 1 CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 24 0 

Mill M I II I II I II I M 1 1 M 1 1 II II M 1 1 M 1 1 i 1 1 1 1 II MM Ml 

450 CCCCCTACAGAAGCAGTGGCTCAGCTGGCGCAGGAGCTCTACAGGAGCGGGTTGCTGGTG 509 
-> '1 ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 30 0; 

Mill 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 II II I M II II I III I II II 1 1 II 1 1 1 1 M M II 

5 1 0 ACACTCATAGCTGACCTGCAGCTCATAGAC.TTTG AGGGAAAAAAAGATGTGACCCAGATA 569: 
3 0 1 TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 360. 

' • M IIIMIMI 1 1 1 1 1 1 ! i 1 1 1 1 ! M M Ml IMIIMIM Mill II Ml 

57 0 TTCAACAACATCCTGAGAAGACAGATTGGTACACGGTGTCCtACTGTCGAGTACATCAGT 62 9 
3 61 GCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 42 0 

MM MUM I II I MM II M 1 1 I MM III 1 1 1 M 1 1! II 1 1 II II I i II II II 

0 TCTCATCCTCACATCCTGTTTATGCTTCTCAAAGGCTATGAAGCCCCACAGATTGCCTTA 689- 
^'GTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 4 80.: 

' ! | |l II II M 1 1 II 1 1 Mill 1 1 1 1 i 1 1 1 1.1 i II i 1 1 1 1 1 1 1 1 1 M ! 1 1 1 1 1 N 1 

; <,o CGCTGTGGGATTATGCTAAGAGAGTGTATTCGACATGAGCCACTTGCCAAAATIGATCCTA 74,9 
, , (1 •r'XTTCTAATf 1 AATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGGT 54 0' 

11111111.111 I II Mi I II HIM Mill i I Ml MM II II II I Ml Ml 

750 TTTTCTAATCAGTTCAGAGATTTCTTCAAGTATGTTGAGCTGTCCACCTTTGATATCGCT 909- 
- •' "> TCAGA^GCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGAC 600; 

1 1 1 ii T 1 1 1 1 ! 1 1 1 1 i 1 1 1 llllllll I M I M M II II II M I II M II II I i I 

oiO TCAGATGCCTTCGCTACTTTTAAGGATTTGTTAACCAGACATAAAGTATTGGTAGCAGAC 869- 
,. nl xTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAG 660 

MMIMMMIIMM II 1 1 1 II 1 1 1 M 1 1 II II II M II I II MM M MMII 

8 7 0 TTCTTAGAACAAAATTATGACACTATTTTTGAAGACTATGAGAAACTGCTGCAATCTGAG 92 9 
* i - 1 AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGA.CCGTCAC 7 2 0- 

M MM I Mill I IIIMIMI Mill III III 

S 3.0 AACTATGTGACAAAGAGACAATCTTT AAAGTTGCGAGGTGAGCTGATCCTGGACCGCCAC 9 89 



•-, i ab CTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAAC 7 8 0, 

" || |i MM Mill 1 1 II II 1 1 II I II 1 1! I I II i I M 1 1 II II I IIIMIMI 

99 0 AATTTCACCATTATGACCAAGTATATCAGCAAGCCAGAGAACCTGAAACTGATGATGAAC 



1049 



-fn CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTT 840. . 

' ' I Ml Ml II I M I ! 1 1 1 1 1 M 1 1 1 ! I M llllllll Mill Mill MMII I 

1 0 5 0 CTGCTTCGAGACAAAAGTCCCAACATCCAATTGGAAGCCTTCCATGTCTTTAAGGTGTTT 110 9 

84-' GTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAA 900 

' MMMM M llllllll llllllll llllllll 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 ^ 11M 

1110 GTGGCCAGCCCCCACA AAACGCAGCCTATCGTGGAGATTCTGTTAAAAAATC AGCCC AAA 116 9 



Qy 

Db 

Qy 

Db 



90] 



1170 



CTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGC AGTTCGCTGAC 960 

i,, mi ii iiiiiiMiii ii ii 1 1 1 1 1 1 1 1 mill iooq 

CTCATTGAGTTTCTGAGCAGCTTTCAGAAAGAAAGG 12 2 9 



1014 



961 QAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA 

I Mill I Mil 1 1 1 1 1 1 1 1 1 Mill MMI I Mill Ml 

12 3 0 GAGAAGAACTACCTGATTAAACAGATTCGAGACTTGAAGAAAGCAGCCCCGTGA 12 83 



RESULT 9 
BD14.7463 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



COMMENT 



FEATURES 
sour.; 



BASE COUNT] 
ORIGIN 



Craniata ; Vertebrata ; Euteleos tomi ; 
Catarrhini ; - Hominidae ; Homo . 



BD147463 822 bp DNA linear PAT 17-JAN-2003 

Primer for synthesizing full-length cDNA and use thereof. 
BD147463 

BD147463.1 GI:27853221 
JP 2C02191363-A/2306 . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; 
Mammalia; Eutheria; Primates; 
1 (bases I to 822) 

Ota,T., Isogai # T. # Nishikawa, T . , Hayashi,K., Saito,K., Yamamoto, J. / 
Tshii,S., Sugiyama,T., Wakamatsu, A. , Nagai,K. and Otsuki,T. 
Primer for synthesizing full-length cDNA and use thereof 
Patent: JP 2002191363-A 2306 09- JUL-2002 ; 
.KELT \ RESEARCH INSTITUTE 
OS Homo sapiens (human) 

JP 2G02191363-A/2306 . 
09--JUL-2002 

28-JUL-2000 JP 2000230990 

TOSHIO OTA , TAKAO ISOGAI , TETSUO NISHIKAWA, KOJI HAYASHI , KAORU 
SAITO 

JUNICHI YAMAMOTO, SHIZUKO T.SHI I . TGMOYASU SUGIYAMA , A_I WAKAMATSU , 
KEIICHI NAGAI , TETSUJI OTSUKI 



?D 
:?V 
PI 
PI 
?I 
PI 

C12N15/09 / C07K14/47,C07K16/18,C12Nl/l5 , C12N1/ 19 , C12N1/2 1 , C12N5/ PC 

PC°' C12^2l/02 # C12Ql/68//C12P'2l/O8,GO6F17/3O # C12N15/00' f C12N5/OO CC 
primer for synthesizing full-length cDNA and use thereof FH Key 
Location/Qualifiers 
FT source 1- .822 

- T /organism='Homo sapiens (human) ' . 

Location/Qualifiers 
1. .822 

/organism=' f Homo sapiens" 
/mol__type= "genomic DNA" 
/db_xref = " taxon : 9606 " 
268 a 164 c 171 g 216 t 3 others ■ • 



Query Match 76.0%; 
Best Local Similarity 98.5%; 
Matches 798 ; Conservative 



Score 770.6; DB 6 
Pred. No. 3 .4e-178 
0; Mismatches 10 



Length 822; 



Indels 



2; Gaps 



Qy 



Db 



78 



i TTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTTGGCC 

illllilllllllllllllMIIMIIIIillllllMIIIIIIIIIMIMIIIMMI 

1 TTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTT3GCC 60 



7 9 ATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCA^^ 138 
61 ATTTTGGAAAA 120 



Db 



i-« rAA-CAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAACAGAAGCAGTG 198 
Qy l,,9 CAAGCAATGAAAGAAAT | | | I I I I I I I I I I M 

121 CAAGcUIgA^^ 180 
199 GCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGATAGC^ 258 

Qy 19 J GCICA^ ...... , | | , , . | I | | | | | | | | | | | | | | | | I I I I I I I N M I I 

,81 GCTCAGCTA^ 240 
r ' CAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAACATCTTGAGA 318 

Qy ,b9 v,AGCTGATAGACT , , , , , , , , , , , | , , | | , , ! | i | | | | | | j | | | | | | | | | | I I I I I I I 

241 CAGCTGATAGACTTTGAGGGAA 3Q ° 

T 1 C -T ftGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGTGCTCATCCTCATATCCTG 378 ' • ' 

3 " M M M If M 1 i i II 1 1 II I i I ! 1 1 III 1 1 1 1 ill 1 1! 1 1 1 JiJI I.'' 'L i ' >;i ™ , . 



Db 



'! C 1 j^GACAGATAGGCACTCGG 360 , ^ 

,79 T TTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTACGTTC 438 , 

.i.i. I I I II I | I | I | I | | | I | | I I I I I I I I I I I I ' ' ' ''''''''' ' ™ • r< 

, fl iiTTATGCTCCTCAAAGGATATGAA 420 ' j& 

j AGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTCTTTTCTAATCAArn 498- " " ^ 

i| ! i I 1 1 M i 1 1 I 1 1 1 1 I Ml I I I I i I M I I I I I I II ' ' 1 1 I ' I 1 1 ' ' ' -4 

-J s ,;u IUgAATGTaU^ 480 ' 



Qy 
Db 

QY 

Cb GO.l 



Db. 
Db 

Qy 
Db 



-A-TTCTTTAAGTACOTGGAGTTGTCAACATTTGATATTGCTTCAGATGCCT^ S»- < ^ 

kl ttcAMSGATTTACT 

- in GACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATTATGTTACTAAGAGA 67-8. 
^ACTATTTTTG . , , , |( , , , , , , | | | , | || | , | | | | | | | . j | | | | | | | | | | I I 

gag actatttttga! GACTATGAGAAATTGCT 



^ Q rArTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATGACA 73 8 
Qy : ,,9 CAGTCTTTAAAGCr o C , , , , , j, , , , , , , | , , | | j | , | , | | | | | | | | | | | | | | | | | | 



/T , AAGTATATCAGCAAGCCGGAGAAGCTGAAACTCATGATGAACCTCCTTC 798 

? : til I II i ll lil IMIII II Mill M II 1 1 III I MIIMIII HIM 

721 j^GTATATCAGCAAGCCGGAG 778 
7 99 CCCAACATCCAGTTTGAAGCCTTTCATGTT 828 



77 9 CCCAACATCCAGTTTGAGCCTTCTGGTTTT 8 08 



RESULT 10 

BP079551 linear PAT 27-AUG-2002 

LOCUS BD07 9551 831 bp una 



DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



Cancer-associated nucleic acids and polypeptides 
BD079551 

BD07 9551 -1 GI : 2262 5154 
JP 2001516009-A/.217 . 
Homo sapiens (human) 
Homo sapiens 

Chordata; 
Primates ; 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



COMMENT 



PN 
PD 
PF 

PR 



US 60/061599 PR 

08/948705 PR 

09/102322 PI LLOYD 
PI TSENG 



?^K:TURES 

source 



BASE COUNT 
ORIGIN 



Eukaryota; Metazoa; 
Mammalia; Eutheria; 

1 (bases 1 to 831) , 
Old,L.J., Scanlan,M.J. , Stockert,E., Gure,A. # Chen,Y.T., Gout, I., 
OGhare,M., Obata,Y., Pf reundschuh, M . , Tureci,0. and Sahm,U. 
Cancer-associated nucleic acids and polypeptides 
Patent: JP 2001516009-A 217 25-SEP-2001; 
LUDWIG INSTITUTE FOR CANCER RESEARCH 
OS Homo sapiens (human). 
JP 2001516009-A/217 
25-SEP-2001 

15-JUL-199S JP 2000503425 

17-JUL-1997 US 08/896164, 10-OCT-1997 

10 - OCT- 1997 US 60/061765, 10-OCT -1997 US 

11- OCT-1997 GB 9721697 . 2 , 22 -JUN-1998 US 

J OLD , MATTHEW J SCANLAN , ELISABETH . STOCKERT , AL I GURE , YAO 
^HEN 

PI 'iVAN GOUT , MICHAEL O' HARE, YUICHI OBATA, MICHAEL " PFREUNDSCHUH , PI 

OZLEM TURECI, 
yl UGUR SAHIN 

V.O?.N33/574,A61K38./00,A61K39/395 > A61K39/3S5,A61K45/00,A61K48/00, PC 
\6 ? P3 5/00 

PC . C07K14/82 / C07K16/32',C12N15/09//C07K16/46 / C12P2l/08,A61K37/02 / 
C12N15/00 : 

Cancer-associated nucleic acids and polypeptides. .FH Key 

Location/Qualifiers 
source 1 . '. 831 

/ organ ism= 'Homo sapiens (human)*. 

Location/Qualifiers 

1. .831 r ' . 

/organism="Homo sapiens" 
/mol_type= "genomic DNA" 
/ db_xr e f = " t axon : 9 6 0 6 " 
285 a 165 c 167 g 209 t 5 others 



PC 
CC 

FT 



Query Match 67 .5%; 

Bes-t Local Similarity .96.1%; 
Matches 764 ;.. Conservative 



Score 684.6; DB 6; 
Pred. No. 4.1e-157; 
0; Mismatches 23; 



Length 831; 
Indels 8; 



Gaps 



6, 



Qy 

Db 

Qy 
Db 
Qy- 



ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 

TTTTTmMM 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 » 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 II 

ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 



60 



96 



<i PTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 120. 

" i||i|||||||l|||||IMMIIIIMIIIIIIII!MMMMIMMIIIIIillM 

97 ctgAaAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 156 



1 ? n GAAGTGTCTA^ATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 

miilMIIMillllllllllillllllllllllllllllllllllllilllllMII 



180 



Db 157 GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 216 

Cv 18' CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 240 

' iiiiiiillllllllllllllllllll ihlMMIIIIIIIIIIMIIIIMMNIII 

Db 217 CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 276 

rw 2»3 ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 300 

" IIMIIMIIIIIIIIIMMIIIIIIIIMIIIIIIMIIIIIIIMIMIMMIIII 

Db 277 ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 336 

™i TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 3 60 

Qy 301 TTTAACAACATC | | | | | | | | II I I I M I I I ! I M I I II M I I I I I I I I 

Db 337 TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 3 96 

Ov 3 61 GCTCATCGTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 420 

Q/ liiiiiMiillllllllllllillliriiillMIIIIIIIIIIIIIMIIIIIIIIII 

Db 397 GCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 456 

r v 421 CGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 4 80 

Tm Ml II I II II II I II I M I M II II 1 1 M Mil I i M II I Ml II i 1 1 1 II II MM 

Db - 457 CGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 516. 

r,-, "81 TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 540 

Qy ,81 TTTTCTAATCAA ( j |- | | | | | II I I I! M I I I I I II I I I I I II I II ! I I 

Db -, 7 XTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 576. 

, rt- 1 AGATGCCTTTGCTACTTTC AA - GGATTTACT AACCAGACATAAAGTGTTGGTAGC - AG 5 98; 

"' M||| MUM Mill I Mill I II II MM MM 1 1 II MM II II M Ml MM II 

- rCAGATGCCTTTGCTACTTTCAAGGGATTTACTAACCAGACATAAAGTGTTGGTAGCAAG 5 36 ■ 
r :,rj ^CTTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTGAGTCTG 65* 

M IMIMMMMMIMMMIM II 1 1 1 1 1 II I M 1 1 1 M I II I II II II I II I 

63 7 AcTTCTTAGAACAAAATTACGACACTA^ 5 96 

,-- 9 AG - AATTATG^TACTAAGAGACAGTCTTTAAAG - CTGCTAGGGGAGCTGATCCTGGACCG 7 16 

M III MMIM 1 1 1 1 M 1 1. 1 1 1 IMIM Mill Mill llllllll 

697 AGAAATTATGTTACCAAGAGACAGTCCTTAAAGCCTGCTAAGGGAACTGATTCTGGACCG 7 56 
- 1 7 TCACAACTTTGCCATC-ATGACAAAGT ATATCAGCAAGCC - - - GGAGAACCTGAAACTCA 772 

Ml Mill Mill II I I Mill Mill MM M Mill II I 

757 TCANAACTTTGCCATCAANGCAAAAGTTTATCAACAAGCCNGGGGAAACCGGAAACNCAA 816 
r, v 773 TGATGAACCTCCTTC 787 

II Mill II MM 

Db a 17 GGAGGAACCTCCTTC 831 



Db 

Qy- 
Db 
Cv 
Db 



RESULT 11 

AX061831 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



AX061831 
Sequence 
AX061831 
AX061831 . 



1026 bp 
from Patent WO0078947. 

GI : 12539911 

(human) 



DNA 



linear PAT 24-JAN-200 



Homo sapiens 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 



Craniata; Vertebrata; Euteleostomi ; 



Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

RE IuTHoS den Daas,I., Fischer, v., Seyfried,C. and von Melchner.L. 
TITLE Head trauma induced cytoplasmatic calcium binding protein 

JOURNAL Patent: WO 0078.947-A 1 28-DEC-2000; 
MERCK PATENT GmbH (DE) 
FEATURES Location/Qualifiers 
source 1- -1026 

/organism="Homo sapiens" 
/mol_type= "genomic DNA" 
/ db_xr e f = " t axon : 9 6 0 6 " 
qpo 1. .1026 

/note="unnamed protein product" 

/codon_start=l 
/protein_id="CAC25030.l" 
/db xref="GI: 12539912" 

'translations "MPFPFGKSHKSPADIVKNLKESMAVLEKQDISDKKAEKATEEVS 
KNLVAMKEILYGTNEKEPQTEAVAQLAQELYNSGLLSTLVADLQLIDFEGKKDVAQIF 
NN1LRRQIGTRTPTVEYICTQQNILFMLLKGYESPEIALNCGIMLRECIRHEPLAKII 
LWSEQFYDFFRYVEMSTFDIASDAFATFKPLLTRHKLLSAEFLEQHYDRFFSEYEKLL 
HSENYVTKRQSLKLLGELLLDRHNFTIMTKYISKPENLKLMMNLLRDKSRNIQFEAFH 
VFKVFVANPNKTQPILDILLKNQAKLIEFLSKFQNDRTEDEQFNDEKTYLVKQIRDLK 

RPAQQEA" 

BASS -COUNT 359 a 199 c 203 g 265 t 

ORIGIN 

-V Hatch 57.5%; Score 582.6; DB 6; Length 1026; ><>■•■ 

- Lo~a* Similari^'. 74.7%; Pred. No. 4.3e-132; ■•><?•-. 

ches" '?4V; Conservative 0; Mismatches 244; Indeis 9 ; 'Gaps. ';■ 

13 GTTTAGl'AAKTCACACAAK^TCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTrGGC 77 

Til 1 M Ml I Ml I MM MM I I Ml MM I MMI M I MM 
12 gtttgggaagtCtcacaaatctccagcagacattgtgaagaatctgaaggagagcatggc 71 - 

78 CATTTTGGAAAAGCAAGAC - - ■ - - "^^^^^^9^9^^^^?^T^^^^^^^^^^T^T? 

ll 1 1 1 1 1 1 1 1 1 1 1 1 1 ■ MMI I M I I II I M i M II I M M 1 1 

72 TGTTCTGGAAAAGCAAGACATTTCTGATAAAAAAGCAGAAAAGGCTACAGAAGAAGTTTC 131 
••2 9 TAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAAC 188 ■ 

Ml ' Ml M Ml II MM i MM M Ml Mill M Mill II I M 

132 CAAAAATCTGGTTGCCATGAAAGAAATTCTGTATGGCACAAATGAAAAAGAGCCTCAGAC 191- 
Ov '■89 AGAAGCAGTGGCTCAGCTAGCACAA3AACTCTACAGCAGTGGCCTGCTAGTGACACTGAT 248 

y 1 ill Ml II Mill M II III Mill Ml I Mill M M M III I 

152 AGAAGCAGTAGCTCA 251 
-A9 AGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAA 30.8, 

' MUM I Mill II I Mill Mill II III III III I M M II Mill 

252 AGCTGATTTACAGCTCATTGACTTTGAGGGCAAAAAAGACGTG.GCTCAAATTTTCAACAA 311 
' ,09 CATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGTGCTCATCC 36 8' 

' ' || l MMMM II II M I I lill Hill M II II I I II I 

3 12 TATTCTCAGAAGACAAATTGGTACGAGAACTCCTACTGTTGAATACATCTGCACCCAACA 3 7 1 
0y 369 TCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTACGTTGTGG 428 

9i nil MM III I I 1 1 1 1 1 1 1 i 1 1 1 I HI I H M II II II II 

D v V2 GAATATTTTGTTCATGTTATTGAAAGGGTATGAATCTCCAGAAATAGCTCTAAATTGTGG 431 



Db 



Db 



Qy 

Db 

o.y 

Db 



oy 42.9 GATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTCTTTTCTAA 48B 

II Ml I Mill II I II II I II Mill I II H UH M II I 

Db 432 AATAATGTTAAGAG^^TGCATC 491 

Qy 439 TCAATTCAGAGATTTCTTTAAGTAGGTGGAGTTGTCAA 548 

1 1 1 1 1 1 1 II M I I I II I ! II II II I II 1 1 II 1 1 1 1 M I II II 

Db 492 ACAGTTTTATGATTTCTTCAGATATGTCGAAATGTCAACATTTGACATAGCTTCAGATGC 551 

Ov 549 CTTTGCTACTTTCAAGGATTTACTAAGCAGACATAAAGTGTTGGTAGCAGACTTCTTAGA 608 

7 mil II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II I Ml Ml II I Mill M II M 

532 AT Ug<2CACATTC^ 611 
.09 ACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATTATGT 668 

in M 1 1 II I 1 1 1 1 1 1 M 1 1 1 M I 1 1 M M ! 1 1 M M II II 1 1 

G12 ACAGCATTATGATAGATTTTTCAGTGAATATGAGAAGTTACTTCATTCAGAAAATTATGT 671 
.69 TACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGC 728 

M M Mill Ml I Mill II I! M II I M II I I MM II I I 

672 GACAAAAAGACAGTCACTGAAGCTTCTCGGTGAACTACTACTAGATAGACACAAC^ 731 
729 CATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAA 

II I M I M M II Mill M M III I m 1 1 1 I II 1 1 I I II 

z^TTATGAGAAAATACATC "1: 
7H9 GGAT^GTCCCAACATCCAGTTTGAAGCCTTTCATG™ B48, 

M 1 1| || I i I II I II I M M I II I I i I II II 1 1 1 1 II II M MM 

..... ^GACAAiJ^GTCGCAACATC^ 851 
' V. TCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAAG^CATTGA 908: 

- mm UN II MM I M r 1 1" Mill i II M Ml Ml Mill II M 
tcctaIcaagacgcaGc^ 

'W 9 GTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAG^ 968 

" : " ill M MM II Mil M I II 1 1 1 M M I II 1 1 1 II I M 1 1 1 1! I II 

512 otIcCTCAGCAAGOTTCAGAACGACAGGACGGAGGATGAGCAGTTTAACGACGAGAAGAC 971 
'-'69 CTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCC 1009 

IM M IMMMMM ! M II III i I MMI 

9 72 CTATTTAGTTAAACAGATCAGGGATTTGAAGAGAGCAGCTC 1012 



Db 

QY 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Db 



Db 

*jBCT£T 12 

AM082322 linear PAT 28-FEB-200 

:r.QCUS AX082322 3281 bp una 

DEFINITION Sequence 26 from Patent WO0111032. 

ACCESSION " AX082322 

VERSION AX082322.1 01:13184499 
KEYWORDS 

c OUR r, E Homo sapiens (human) 

ORGANISM H°- r -P^- efcazoa; chordata; craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhin.L; Hominidae; Homo. 

RE aShORS iodqsonD,M.. Lincoln, S . E . , Russo,F.D. ( Spiro,P.A., Banville, S . C . , 

AUTHORS R '.. ^^.^ Co hen,H.J., Rosen, B . H . , Chalup.M.S.. 

Killman,J.T,., Jones, A. I.., Yu,J.Y. f Greenawalt , L . B . , Panzer, S.R., 



Rcseberry, A.M. , Wright, R.J. and Daniels, S.E. 
TITLE Secretory molecules 

JOURNAL Patent: WO 0111032 -A 26 15 -FEB- 2 00 1 ; 
Incyte Genomics, Inc. (US) 
FEATURES Location/Qualifiers 
source 1- .32 81 

/organism="Homo sapiens" 
/ mo l_type= "genomic DNA" 
/db_xref = " taxon : 9606 " 
/note=" Incyte IDNo: 481257.3" 
BASE COUNT 1014 a 601 C 676 g 990 t 

ORIGIN 

n u ,rv M^ch 57.5%; Score 582.6; DB 6; Length 3281; 

Best Local Similarity 74.7%; Pred No. 4.3e-132; 

Matches 748; Conservative 0; Mismatches 244; Indels Gaps 
Ov 3 8 GTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCT 77 

Qy . 1 1 1 1 I II II 11 II 1 1 1 1 1 1 1 1 1 1 1 II II I II I I MINIM Ml! 

■103 GTTTGGGAAGTCTCA^ 160 



Db 

Qy 

Db 



78 CATTTTGGAAAAGCAAGAC AAAAAGACAGAC^GGCTTCAGAAGAAGTGTC 128 

II Ml II! I II II III HI 1 1 MM HUM III IN 1 1 MM 

, 61 TG || c iii^GCAlGACATTTCTGATAAAAAAGCAGAAAAGGCTACAGAAGAAGTTTC 220 
Qv TAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAAC^ 188 

Mi in || I II II II I II M MM 1 1 1 M 1 1 1 M I II II 1 1 I M 

-1 CAAAAATGTGGTTGCCATGAAAGAAATTCTGTATGGCACAAATGAAAAAGAGCCTCAGAC 28.0, 
AGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGAT 248: 

""' rill I MM Mill II Mill II I Ml M I MM M M 1 1 M I I. 

AGAAGCAGTAGCTC^CTTGCXCAAGAAGTC 3 ^ 
,0 AGGTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAA 308, 

' 1 1 1 j 1 1 I HIM II IMMIMMI MINIMI! I M II II Mill 

,41 AGCTGATTTACAGCTCATTGACTTTG 

V09 CATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGTGCTCATCC 368 

" II I MINIM II M II I i INI III II II II II I I N I 

,01 TATTCTCAGAAGACAAATTGGTACGAGAACTGCTACTGTTGAATACATCTGCAGCCAACA 460 
Ov ' 369 TCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTACGTTGTGG 428 

Y 1 1 1 1 MMIIM I II I M 1 1 H 1 1 r I II I 1 1 1 1 N MINI 

GAATATTTTGTTCATGTTATTGAAAGGGTATGAATCTCCAGAAATAGCTCTAAATTGTGG 520 



Db 

Qy 

Db 

Qy 
Db 



Db 461 



O-, GATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTCTTTTCTAA 488 

ll ill I I II II II 1 1 III INI II INN II HI II III II N I 

Db 521 AATAATGTTAAGAGAATGCATCAGACATGAACCACTTGCAAAAATCATTTTGTGG 580 

489 TCAATTCAGAGATTTCTTTAAGTACGTGGAGT.TGTCAACATTTGATATTGCT 548 

II 1 1 I II I ! 1 1 1 I UN I II II I II N 1 1 1 N II II N IN 1 1 

r q ■■ ACAGTTTTATGATTTCTTCAGATATGTCGAAATGTCAACATTTGACATAGCTTCAGATGC 640 



Qy 

Db 



.4 9 CTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGACTTCTTAGA 

" Mill II 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 I Ml II INI I N I Mill N M 



608 



Db 



6 „, ™-™^rin™m 668 

, 29 CATCATGACAAAGTftTATCAGCAAGCCGGAGAACCTGR^CTCAMATGMCC 788 
789 GGATAAAAGTCCCAACATCCAGTTTGAAGCCTITCATGT^ «• 

.„ -ctgacaaaagagaggctattgtggaga™^ 

M1 rJ-ccTAACAAGACGCAGCCCATGCrAGACATCCTC V- 
S09 GTTTCTGAGGAGGTTGCAAA^GAAAGGAGGGATGA— 

„, ;1 gttcctcJ^gcaagt^cagaIcgacaggIosgagga »« 
,« c»cttgattaaacagatgggagac™aagaaaacggccc ioo» 

iM1 l^LS^ 



Qy 

Db 



Db 

qy 

Db 

Qy 



Qy 

Db 



mm™ cnn 3761 bp mRNA linear PRI 22-JAN-2002 

^ 2057 °. MO?r Drotei n Clone MGC:21*31 IMAGE : 43 97573, . mRNA, 

:fe^SNIXXON Homo sap.iem;, M02!> protein, 
complete cds . 

ACCESSION BC020570 

VERSION BC020570.1 GI : 1B0882*0 

KEYWORDS MGC. 

iCURCS Homo sapiens (human) 

ORGANISM Homo sapiens rr . ■ ata . Vertebrata; Euteleos tomi ; 

sswss; s:; sssa., — 

REFERENCE" 1 (bases 1 to 3761) ...... 

?<U.5^ : OPS 3 1 rausberg , R • 

STLE Di l eC lZftTjZ-2002) National Institutes of Health, Mammalian 

JOURNAL Submitted (03 -JAN 2002 Genomics office, National Cancer 

;-» pri p Collection (MCju; , LclilLCi w ^ , » AT . on«Q9-9 c ;90 

institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892 2590, 

USA 

REMARK NIH-MGC Project URL: http://mgc.nci.nrh.gov 
CMM3ST' Contact: MGC help desk 

Email: cgapbs-r@mail.mh.gov 
Tissue Procurement: ATCC 

cPNA Library Reparation -Life Technologies Inc } 

cDNA Library Arrayed by. The 1 - H - A - Q J~ p stan ford Human Genome 
DMA sequencing by: Sequencing Group at ; ne ^anford ^ ^ 

Center, Stanford University School ot M e dl " ne ' 
^ b si ; e: http://www-shgc.stanford.edu 



FEATURES 

source 



Cofitact: (Dickson, Mark) mcd@paxiI.stanford.edu 

Dickson.. M . , Schmutz, J., Grimwood, J. f Rodriquez, A. , and Myers, 
R . M. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 2 7 Row: d Column: 16 

This clone was selected for full length sequencing because it 
passed the following selection criteria: Hexamer frequency ORF 
analysis, GenomeScan gene prediction. 

Location/Qualifiers 

1. .3761 

/organisms "Homo sapiens" 

/mol_type= M mRNA" 

/db_xref- M Locus ID: 51719" 

/ db_xr e f = " t axon : 9 6 0 6 " 

/clone="MGC: 21631 IMAGE : 43 97573 " 

/tissue type -"Duodenum, adenocarcinoma" 

/clone_Iib="NIH_MGC_8 8" . 

/lab_host-"DH10B" 

/note="Vector: pCMV-SPORT6" 

325. .1350 

/codon_start=l 

/product="M02 5 protein" 

/piotein_3 d="AAH20570 .1" 

/db xref="GI: 18088261" 

/ translate on= " M P F P FGKSHKS PAD I VKNLKE S MAVIjE KQD I SDKKAE KATEE VS 
KL^LVAMKEILYGTNEKEPQTEAVAQLAQELYNSGLLSTLVADLQLIDFEGKKDVAQIF 
iqNILRRQIGTRTPTVEYICTQQNILFMLLKGYESPEIALNCGI'MLRECIRHEPLAKII- 
LWSEQFYDFFRYVEMSTFDIASDAFATFKDLLTRHKLLSAEFLEQHYDRFFSEYEKLL 
HSFNYVTKRQSTjKLLGELLLDRHNFTIMTKYISKPENLKIiMMNLLRDKSRNIQFEAFH 
VFKVFVANPNKTQPILDILLKNQAKLIEFLSKFQNDRTEDEQFNDEKTYLVKQIRDLK 



"3A^ij CC'Jiv': 
ORIGIN 



RPAQQEA" 
1171 a 709 c 



808 g 



1073 t 



Query Match 57.5%; Score 582.6; DB 9 

-est Lecp.l Similarity 74.7%.; Pred. No. 4.3e-132 
748; Conservative 0; Mismatcnes 244 



Length 3761; 
Indels 9; Gaps 



QV 
Db 



i-8 GTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTTGGC 

mm | 1 1 | ; 1 1 1 1 1 1 MINIMI MINIM I MM! II I Mil 

■= 3 6 GTTTGGG^AGTCTCACAAATCTCCAGCAGACATTGTGAAGAATCTGA^^GGAGAGCATGGC 



77 



78 C A TTTT GGAAAAG C AAG AC AAAAAGACAGACAAGGCTTCAG^Gi^GTGTC 128 

m IN Mill II MM 1 1 1 1 1 MM I II II IN I M 

3 96 tgTTCTGGaUaGCAAGACATTTCTGATAAAAAAGCAGA^ 455 



Qy 

Db 

Qy 

Db 

Qy 



188 



129 tAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAAC 

mi Ml || 1 1 1 1 1 1 M I II II 1 1 1 Mi II I i I i i 1 1 1 1 1 II I 1 1 

,'56 CAAAAATCTGGTTGCCATGAAAGAAATTCTGTATGGCACAAATGAAAAAGAGCCTCAGAC 515 



i 69 AGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGAT 24 8 

1 1 M Ml II Mill II M Mill II I IN I INN II II N III I 

516 AGAAGCAGTAGOT 575 
24 9 AGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAA 3 0 8 



MINI I 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 llllllll Ml I M II H MNI fi „ 

Qy 309 CATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGTGC 368 

ii I 1 1 1 1 1 1 1 1 II 1 1 M MUM I MM 

Db 6 , 6 TATTCTCAGAAGACAAATTGGTACGAGAACTCCTACTGTTGAATACATCTG 695 

Cy 369. TCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGC^ 428 

y 1 1 1 1 II I II I II II M M M M M M M M M I M M M M 

Db S96 GAATATTTTGTTCATGTTATTGAAAGGGTATGAATCTCCAGAAATAGCTCTAAATTGTGG 755 

0y 4- GATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTCTTTTCTAA 488 

Qy 1 1 1 1 1 || 1 1 1| I II M I II II II I II M M II II I M M I 

Db 756 AATAATGTTAAGAGAATGCATCAGACATGAACCACT 815 

Qy 489 TCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATA S48 

1 1 1 1 I Ml MM I II II I II M II II II I II M M M 1 1 

Db 816 ACAGTTTTATGATTTCTTCA.GATATGTCGAAATGTCAACATTTGACATAGCTTCAGATGC 875 

Cv 549 CTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGACTTCTTAGA 608 

* Mill || II II II I M M M I M MMMMI I I I MM M 1 1 1 1 

Db 876 ATTTGCCACATTCAAGGATTTACTTACAAGACATAAATTGCTCA 935 

609 ACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAG 668 

in 1 1 1 |i || II 1 1 II M M I II M I M M I II M II II I M I 

,,, ACAGCATTATGATAGATTTTTCAGTGAATATGAGAAGTTACTTCATTC'AGAAAATTATGT 995 : 

TACTAAGAGAGAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACA^CTTTGC 72*:.. 

" ■ ii I ; imiiiii M I II M I I M i 1 1 I II II i M M II M I 

Dh gIcaaaaagacagtcactgaaggttctcggtgaactactactagatagacacaacttcac 10,5«>- 

■,,9 CATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATG^ 788,. 

1 1 iiiiuii || HIM 1 1 M MUM I II M M M II I II M 

i 0 5 6 AATTATGACAAAATAC ATCAGTAAACCTG^ ^ 5 
. 789 GGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTC 84 8 

M MM || I 1 1 1 1 1 1 M 1 1 1 1 1 1 1 MIMMMMMMM MM 

1116 AGACAAAAGTCGCAACATCCAGTTTGAGGCCTTTCACGTTT 1"5 
849 TCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGA 908 

1 1 1 1 I II II I II II M I I II IHII M I II II M M II II I II 1 1 

13.76 TCCTAACAAGACGCAGCCCATCCTAGACATCCTCCTCAAGAACCAGGCCAAACTCATAGA 1235 
"09 GTTTCTGAGCAGCTTCGAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGACGAGAAGAA 968 

Ml M MM II M M II 1 1 Mill I IIMII Mill II Ml II III 

Db ' 12*6 GTTCCTCAGCAAGTTTCAGAACGACAGGACGGAGGATGAGCAGTTTAACGACGAGAA.GAC 1295 

„, 9 CTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCC 10 09 

Qi MM II MMMMI MIMI Mill MM c 

Db 12 96 CTATTTAGTTAAACAGATCAGGGATTTGAAGAGACCAGCTC 13 3 6 

RESULT 14. 

SS 824 AF151824 1680 bp mRNA linear PRI 1B-MAY-2000 

DEFINITION Homo sapiens CGI -66 protein mRNA, complete cds . 
ACCESSION AF151824 



r>;- 



Db 



VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



COS 



BASE COUNT 
ORIGIN 



AF151824 .1 GI : 4 92 96 00 

Homo sapiens (human) 
Homo sapiens 

^ukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1680) 

I,ai,C.H., Chou,C.Y., Ch'ang,L.Y., Liu,C.S. and Lin, W. 

Identification of novel human genes evolutionarily conserved in 

Caenorhabditis elegans by comparative proteomics 

Genome Res. 10 (5), 703-713 (2000) 

20272150 ' 

10810093 

2 (bases 1 to 1680) 
Lin, W. -C. 

Direct Submission . 
Submitted (17 -MAY-1999) Institute of Biomedical Sciences, Academia 
Sinica, No. 128, Sec. II, Academia Road, Taipei 115, Taiwan 

Location/Qualifiers 

1. .1680 

/organism- "Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
i. .1026 
/codon _start=l 
/product="CGI-66 protein" 
/protein_id="AAD34061. 1" 
- /db xref="GI : 4929601" 

/ ran s 1 a t i on = " MP F P FG KSHKS PAD I YKNL KE S MA VLE KQD I S D KKAE KATEE V S 
KN LVAMKE I L YGTNEKE PQTEAVAQLAQEL YNSGLLS TLVADLQL ID FEGKKDVAQ I F 
NNILRRQIGTRTPTVEYICTQQNILFMLLKGYESPEIALNCGIMLRECIRHEPLAKII 
LWSEQFYDFFRYVEMSTFDIASDAFATFKDLLTRHKLLSAEFLEQHYDRFFSEYEKLL 
HSENYVTKROSLKLLGELLLDRHNFTIMTKYISKPENLKLMMNLLRDKSRNIQFEAFH 
VFKVFVANPNKTQPILDILLKNQAKLIEFLS.KFONDRTEDF.QFNDEKTYLVKQIRDLK 

RFAQQEA" 

540 a 324 c 358 g 458 t 



nuerv Match 57.4%; Score. 581.6; DB 9 

Stest" Local Similarity 74.7%; Pred. No. 7.5e-132 
Matches 747; Conservative 0; Mismatches 244 



Length 1680 ; 

Indels 9; ;Gaps ' 1; 



Qy 


19 


Db 


-L 


07 


79 


Db 


73 


Qy 


13 0 


Db 


13 3 




1.9.0 


Db 


193 



L 11 r\'j J- rt^-Vtt. X ^rt^r^^^r-Lr-LXTjrT. ^ *~v_., " 

Ml ! II MINIM 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 



MM. II I M 



II II III Mill III 



II II II II II Mill 



•GGAAAAGCAAGACATTTCTGATAAAAAAGCAGAAAAGGCTACAGAAGAAGTTTCC 



132 



AAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAACA 189 

.... , - . i i i i i i l l 1 I I I I I I ill I I I I I I I ''([I ' ' ^ \ \ \ 



II 



I II 1 1 M II 1 1 II I II III Mill 



II 



I MM III Mill M i I I M M 1 1 ! 1 1 1 I III I IN i 



Ov 2 SO GCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAAC 309 

' Mill I 1 1 1 1 1 || i 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 ! i 1 1 I i ! ! II II II I M 1 1 

Db 2 53 GCTGATTTACAGCTCATTGACTTTGAGGGCAAAAAAGACGTGGCTCAAATTTTCAACAAT 312 

310 ATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGTGCTCATCCT 369 

II MM Mill 1 1 I II I I MM MM I II II II I Ml I 

3 13 AtTOTCAGAAGACAAATTGGTACGAGAACTCCTACTGTTGAATAC 372 
-70 cATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTACGTTGTGGG 429 

J 1 1 1 1 M M II I I I II I II 1 1 1 M I I 1 1 1 I 1 1 1 1 N 1 1 1 1 1 1 
3 7 3 ^Uttttgttcatgttattgaaagggtatgaatctccagaaatagctctaaattgtgga .4 3 2 

4*0 ATTATGCTGAGAGAATGTATTCGAGATGAACCACTTGCCAAAATCATCCTCTTTTCTAAT 4 89 . 

'"' i i i M i M I M II i 1 1 II I II 1 1 1 II II 1 1 1 1 I II II I M II II I 

433 atAATGT^AGAgUtGCATCAGACATGAACCACTTGCAAAAATCATTTTGTGGTCGGAA 492: 
490 cAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCTTCAGATC 549- 

!i || M Mill! I II II M II MM I MINI II IMMMIMI 

Db , iSB CAGTTTTATGATTTCTTCAGATATGTCGAAATGTCAACATTTGA 552 

S50 TTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGACTTCTTAGAA 609 

llll || II Ml IN I II 1 1 II II II Mill II I MM! II rl III 

D> TTTGCCACATTCi^GGATT^ 612 ' 



cy 

Db 

Qy 

Db 

Qy 

Db 



Di.3 



Db 



• ■ o ; -AAAATTAC^AC^TATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATTATGTT ,69. 

0 MM M I MM II Mill III' M Mill il M 1 1 1 1 I.I 1 1 

A< --^GAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAA.e^TTGCC 7 2-9- C 

" iV: i I II I II 1 1 I Mill Mil MM Mill I 1 1 1 1 1 1 1 1 I ' 

,..73. IcAAAAAGACAGTCACTgJ^ 7327 ' 

:3C XTCArGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAACCTCCTTCGG 7 8 9- 

' u 1 1 1 1 1| M II Mill MM MMMM Ml i IIMMIMM II M 

733 ATTATOACAAAATACATCAGTAAACCTGAGAACCTCAAATTAATGATGAACCTGC 792- 
730 GATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTTGTGGCCAGT 84?, 

M MUM | 1 1| II II II 1 1 1 II I MMMM II III III III IIHII - LIU I 

793 GACAAAAGTCGCAACATCCAG^ 852 
, ,0 CCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAG 909 

" Ml MM I! MUM I I II MM! II I II II I II II 1 1 1 1 1 1 1 1 1 , 

8S3 . CCTAACAAGACGCAGCCCATCCTAGACATCCTCCTCAAGAACCAGGCCAAACTCATAGAG 912^ . 
7 1 0 TTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGACGAGAAGAAC 969: ■ 

v; " M || Mil || M II II II I II 1 1! I M II II I II I IIMMIMM 

913 ^CCTCAGCAAGTTTCAGAACGACAGGACGGAGGATGAGCAGTTTAACGACGAGAAGACC 972 
77') TACTTGATTAAACAGATCCGAGAGTTGAAGAAAACGGCCC 100 9 

1 1 j | M II II M I II I II 1 1 1 1 1 1 1 I I II I 

973 TATTTAGTTAAACAGATCAGGGATTTGAAGAGACCAGCTC 1012 



RESULT 15 

AF113536 

LOCUS 



AF113536 



3466 bp mRNA linear ?RI 04 -DEC- 1999 



DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



FEATURES 

source 



Homo sapiens M025 protein mRNA, complete cds . 
AF113536 

AF113536.1 GI: 6523826 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 3466) 

Jin # W. f Shi , J. , Ren,S., Gu,J., Fu,S., Huang ,0-, Dong,H., Yu,Y. , 

Fu,G., Wang,Y., Chen,Z. and Han,Z. 

A novel gene expressed in the human hypothalamus 

Unpublished 

2 (bases 1 to 3466) ■ 
jin |Ww Shi , J . , Ren,S., Gu,J., Fu,S., Huang., Q. , Dong,H, , \u,Y., 
Fu,G., Wang,Y. f Chen,Z. and Han>Z. 

Direct Submission ^ ^ , 

Submitted (16-DEC-1998) Chinese National Human Genome Center at 
Shanghai, 351 Guo Shoujing Rd. , Zhang jiang Hi -Tech Park, ,Pudong, 
Shanghai 201203, China 

Location/Qualifiers 
1. .3466 

/organism="Homo sapiens" 
/mol_type-"mRNA" 
/ db_ xref = "t axon :960b 1 ' 

/tissue_type.= ,, hypothalamus M _ 
54. .1079 

/ codon_s tart = 1 " . " ■ - 
/product="M02 5 protein" 
/ P rotein_id="AAF14 87 3.1" 
/db xref="GI: 6523827" 

/ trans 1 at i on-"NPFPFGKSHKSPADIVKNLKESMAVLEKQDISDKKAEKATEEVS 
KNLVAMKEILYGTNEKEPQTSAVAQLAQELYNSGLLSTLVADLQLIDFEGKKDVAQIF 
MNILRRQIGTRTPTVEYICTQQNILFMLLKGYESPEIALNCGIMLRECIRHEPLAKII 
LWSEQFYDF-FRYVEMSTFDIASDAFATFKDLL.TRHKLLSAEFLEQHYDRFFSEYEKLL 
HSENYVTKRQSLKLLGELLLDRHNFTIMTKYISKPENLKLMMNLLRDKSRNIQFEAFH 
VFKVFVANPNKTQP I LD I LLKNQAKL I EFLS KFQNDRTEDEQFNDEKTYLVKQ IRDLK 



BASE COUT'.'.' 



RPAQQEA" 
1101 a 606 c 639 g 



1070 t 



Qaerv Match 57.4%; Score 581.6; DB 9 

Best-" Similarity 74.7%; Pred . No. 7.6e-132 

Matched 7^7; Conservative 0; Mismatches 244 



Length 3466; 
Indels 9; Gaps 



l; 



Qy 

Db 

QY 
Db 

Qy 

Db 



in TTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTTC 7 8 

"""HI I II M II 1 1 1 1 Til I III 1 1 I MM II I I I MM M 1 MM 10C 

66 4ttggaaagtctcacaaa^ 125 

73 attttggaaaagcaagac— aaaaagacagacaaggcttcaqaa 129 

II II III MM Mill 1 1 Ml I ill MM II 1 1 1 1 1 1 1 1 1 1 II 

126 GTTCTC^ 185 
130 AAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAACA 189 

ill Ml || | i 1 1 1 1 1 1 M I i M 1 1 Ml MMI II Mill II I III 

i-e aajLu^tctggttgccatg 245 



1 90 GAAGCA.C3TGGCTC AG CT A.C5 CA.C T CT.AC A.GTGGC CTGC^TJVGT^ AC^^C^TGATA 249 

310 ATCTTGAGAAGACAGATAGGCAGTCGGAGTC^ 369 
370 GA.T ATC GTG*rTTA.TGCiTC CT ^"*^^'^'^^'?* , ^^?^^^?^ 429 

I I I I I I I I I I I I I ' ' ' ' ' ' ' ' 

i2S AATATTTTCTTCATC 435 
430 ATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAAT^ ^ 

i i i i I I I i II II 1 1 1 1 MINN I ! II I 

486 ATAATGTTAAGAGAAT ^ 
,90 CAATTCAGAGATTTCTTTAAGTACGTGGAGTTG^^ ^ 

1 1 1 1 1 1 1 1 1 1 1 1 I II III M Mill II 1 1 1 1 

3*6 cagttttatgattI'cttcagatatgtcgaaatgtg 60S. 

5 TTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGACTTCTTAGAA 60-9.. 

IT 1 1 ii 1 1 1 1 1 1 1 1 i 1 1 1 i I II Mill 1 M Mill 1 1 II IN 

,0, |-XTGCCACATTCAAGGATTTACT^ 6 65 " 

,-0 CAAAAXIACGACACTATTTTTGAAGACTATG^ * & * . 

; i I I I I I ! r 1 I II I I I M 1 1 1 M I I I 

0,, CAGcJvTTATGATAGATTTTTCA.3TO 

ACTAAGAGACAGTCTTTAAAGCTGC^^ 729 

; iiTi 1 1 1 1 1 ii i i 1 1 1 i > 1 1 1 1 I 1 1 1 1 1 1 1 1' I 

acaa^agacagtcactgIagct 785 ' - 

730 ATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATC 7S9. 

i i i i i i i i I I II Mill I I I I M I I I M M 

78S attatgacaaaatacatcagtaaacot 845 

790 GATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTT B49 

i i i : 1 1 1 1 1 1 1 1 1 1 M I M 1 1 MM II I II 1 1 1 M II M 1 1 1 1 l 

S« GACAAAAGTCGCAACATCC^ 

Bsc cctcacaaaacac::agcctattgtggagatcctgttaaaaaat^ 909 
iii imi it Mill II IN III I M II M I M 111 

906. ^UgAAGA^ 965 
9X0 TTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAG^ ™ 

, M .iUUiU^ 1025 

97 0 TACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCC 100 9 

' Mil I II I I I M I I I I I I I M II I I I I I I I 

1-26 TATTTAGTTAAACAGATCAGGGATTTGAAGAGACCAGGTC 1065 



Search completed: January 6, 2004, 02:34:57 
Job time : 3 9^1 sees 



GenCore version 
Copyright (c) 1993 - 2004 



5.1.6 

Compugen Ltd . 



CM nucieic 
Run on:. 



nucleic search, using sw model 



January 



2004, 00:31:27 



; Search time 390 Seconds 

(without alignments) 

7018.539 Million cell updates/sec 



Title : 

Perfect score: 
Sequence^ : 



US-10-088-872-1 
1014 

1 atgaaaaaaatgcctttgtt tgaagaaaacggccccttga- 1014 



Scoring table : IDENTITY__NUC 

Gapop 10.0 , Gapext 1.0. 

Searched: 2552756 seqs, 1349719017 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Vost -;proc-;i3sin : : ' : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5105512 



;a : oe 



6 
7 
8 
9 

10: 
11 : 
. 12 : 

13 : 

14 : 

15 : 

16 : 

17 : 

18 : 

19 : 

20 : 
21: 

22 : 

23 : 

24 : 

25 : 

Precl. No. is 



N_Geneseq_19Jun03 : * 

1:" /SIDSl/gcgdata/geneseq/geneseqn--enibl/NA1980.DAT:* 
2 : /SIDSl/gcgdata/ geneseq/ geneseqn-embl/NA1981 . DAT : * 
3: /SIDSl/gcgdata/ geneseq/ geneseqn-embl/NA1982 .DAT : '* 
4 : /SIDSl/gcgdata/ geneseq/ geneseqn-embl/NAl 9 83 .DAT : * 
/3IDSl/gcgdata/geneseq/geneseqn-embl/NA1984 .DAT: * 
/SIDSl/gcgdata/ geneseq/ geneseqn-embl/NA1985 . DAT : * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1986 . DAT : * 
/SIDSl/gcgdata/ geneseq/ geneseqn-embl/NAl 987 . DAT : * 
/SIDSl/gcgdata/ geneseq/ geneseqn-embl/NA1988 . DAT : * 
/SIDSl/ gcgdata/ geneseq/ geneseqn-embl/NAl 9 8 9 . DAT : * 
/SIDSl/gcgdata/ gene seq/ geneseqn-embl/NA1990 . DAT : * 
/SIDSl/gcgdata/ geneseq/ geneseqn-embl/NA1991 . DAT : * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1992 .DAT:* 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1993 .DAT:* 
/SIDSl/gcgdata/ geneseq/geneseqn-embl/NA1994 . DAT : * 
/SIDSl/gcgdata/ geneseq/ geneseqn-embl/NA1995 . DAT : * 
/SIDSl/gcgdata/geneseq/ geneseqn-embl/NA1996 . DAT : * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1997 .DAT:* 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1998 .DAT: * 
/SIDSl/gcgdata/ geneseq/ geneseqn-embl/NA1999 . DAT : * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2000 .DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2001A.DAT:^ 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2 0 01B.DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2002 .DAT:* 
/3IDSl/gcgdata/geneseq/geneseqn-embl/NA2003 .DAT: * 



the number of results predicted by chance to have a 



sco— areater than or equal to the score of the result being printed, 
and ^derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

,Ko. Score Match Length DB 



ID 



Description 





1 


1014 


100 . 0 


1 U 14 


9 9 
Z Z 


a AFR64 62 




2 


1014 


100.9 


14 2 J. 


9 9 
Z Z 


AATSR7 34 




3 


1010 . 8 


99.7 


1 'J A A 
1>544 


9 1 

Z -L 


AAA273 32 




4 


9 92 . 8 


97 . 9 


/UUZ 


9 9 
z z 


AAH1 S879 




5 


770 . 6 


76.0 


o o o 
OZZ 


7 9 

z z 


a awn ^4. 7 1 




6 


684 . 6 


67 . 5 




9 n 
z u 


a AX 3 9R 1 7 


c 


*■/ 


634 .4 


67.5 


iiy i 


7 9 

z z 


a. a i £ n 0 9 0 

MJA X D U v Z W 


8 


582 . 6 


57.5 


lUzb 


7 9 

z z 


aaPQI 772 




9 


582 . 6 


57.5 


o Z O X 


9 4. 
z *± 


ABK13 127 




10 


582 . 6 


57 . 5 


O Q A Q 


9 ^ 

Z J> 


ARV2 2 987 




11 


5 82 .6 


57 .5 


"3 Q A Q 


7 1 

Z J) 


ARV2 8822 




12 


541 . 6 


53.4 


*1 AC') 

1U j j 


2 2 


AAF3 06 8 8 




13 


539.6 


53.2 


1 1 £9 


2 3 


AAS 8 95 5 7 


c 


14 


520 , 2 


IT 1 T 

51.3 


O J -5 


9 0 
z u 


A AX 3 9818 




15 


496 


4 8.9 


O A Q 9 


9 1 

Z J> 


AAS8803 1 




16 


3 87.8 


3 O O 

jo.z 


7 9 9 
/ Z Z 


2 0 


AAZ15133 




~ 7 


362 . 3 


3 5.8 




2 3 


ARL07 15 1 




13 


362 . 3 


3 5.8 


A ") 7 1 


9 

Z J 


ART, 07150 




19 


2 8 3 - 8 


2 8.5 


o y u 


9 A 
Z *± 


arc7 7 o 84 




20 


£46 . 4 


24 . 3 


/IOC 


9 A 
Z 4 


adtq 7 9 R S 




21 


■ : 14 . 8 


24 . 1 


/I /I "7 

44 / 


9 A 
Z 4 


ART.R9 ^9 1 




■22 


; >14 . 3 


24 . 1 


A C A 
4 D U 


9 A 
z *± 


ART.R 1 97 5 




A? 


ill 0.3 


2 0.8 




7 A 

.4*1 


AR e ?7 6 7 84 




-4 


210 .4 


20.7 


T A 7 A 
.14 / 4 


9 1 
z X 


AAP3 2 983 




' 2 5 


2 0 8.3 


20.6 


1 /I Q 7 

14 y / 


9 1 

Z .L 


a AP4 0181 




c 


20 0 . 2 


19.7 


y i o 


9 1 

Z X 


AAC4 2 766 




27 


: •. 0 0 . 2 


19.7 


lUJz 


9 1 


a AP4- R9 S 3 


r; 


28 


195 


19 . 2 


TOT 

3 o / 


"> A 
Z 4 


MX51M ?J^OJ 


c 


29 




1 Q 9 

X 27 . Z 


387 


24 


ABL66143 




30 


16 9. 3 


16.7 


722 


24 


AAS61992 




3 1 


166.6 


16 .4 


700 


24 


AAS61993 




32 


163 .8 


16 .2 


481 


25 


ABZ19574 




33 


163 .4 


16 .1 


300 


20 


AAZ14552" 




34 


161.2 


15 . 9 


1515 


21 


AAC5.0415 




3 5 


156 


15 .4 


861 


24 


ABN98824 


C 


36 


153.4 


15.1 


73.7 


23 


AAS7 944,9 




37 


147.2 


14 .5 


464 


21 


AAC46721 




38 


133 .2 


13 .1 


615 


22 


AAH07116 




39 


107 . 6 


10 .6 


1149 


23 


AAS88030 




40 


107.6 


10.6 


3279 


23 


AAS89559 




41 


65 . 6 


6.5 


432 


24 


ABN7 8107 




42 


65 


6 . 4 


487 


22 


AAI98879 




43 

44 


65 


6.4 


487 


22 


AAI64066 




53 .6 


5.3 


254 


25 


ABX31310 




45 


43 


4.2 


447 


21 


AAC06449 



Human Acute Neuron 
Human polynucleoti 
Human calcium bind 
Human cDNA sequenc 
Human cDNA clone ( 
Gastric cancer ass 
Human polynucleoti 
Human ANIC-BP (acu 
Human secretory po 
Human prostate exp 
Human prostate exp 
Human acute neuron 
DNA encoding novel 
Gastric cancer ass 
DNA encoding novel 
Human gene express 
Drosophila melanog 
Drosophila melanog. 
Frog embryonic gen 
Human ovarian ca.ac 
Human ovarian cane, 
Human ovarian cane 
Frog embryonic gen 
Arabidopsis thalia 
Arabidopsis thalia 
Arabidopsis- thalia 
Arabidopsis. thalia 
Gene #481 used to 
Lung cancer relate 
Porcine muscular, s 
Porcine muscular s 
Group III cDNA can 
Human gene express 
Arabidopsis thalia 
Arabidopsis- thalia 
DNA encoding novel 
Zea mays DNA fragm 
Human cDNA clone ( 
DNA encoding novel 
DNA encoding novel 
Human ORF3 0 54 cDNA 
Human excretory re 
Human bladder rela 
Human GDP-mannose 
Human secreted pro 



ALIGNMENTS 



RESULT 1 
AAF86462 

ID AAF86L462 standard; cDNA; 1014 BP. 

XX 

AC AAF86462; 
XX 

DT 2.6-ITOK-20O1. (first entry) 
XX 
DE 
XX 
KN 
KW 

kw 

XX 

OS Homo sapiens . 



• V 



Human Acute Neuronal Induced Calcium Binding Protein, ANIC-BP, cDNA. 

Buna- cerebroprotective; neuroprotective; vulnerary; vaccine,- 
Sn" --rapy; Acute Neuronal Induced Calcium Binding Protein; ANIC-BP; 
"t^-:'"'acut; head trauma; multiple sclerosis; spinal core injury ; bb. 



XX 
FH 

FT CDS 
FT 
FT 
FT 



Kev Location/Qualifiers 
1. .1014 

/product^ "Human Acute Neuronal Induced Calcium Binding 
Protein, ANIC-BP" 



y?l WO200123552-A1. 
T-'j 0 5-; PR -2001 . 



:.-3-?iwp--20 00 ; 2 0 00WO-EP0 9132 . 

. : , A ^ ;E p. 1999; 99EP-0H8848 . 

YM ' • • • - ' 

• A ' ) MERCK PATENT GMBH. 

XX ' 

PT \ I;aas 3C , Duecker K; 

DR vJP'i; 2001-308142/32. 
P-PSDB; AAB82090. 



"ct-1 human acute neuronal induced calcium binding polypeptide,- and -. : 
^^nucleotides encoding them useful for diagnosi ng - treating- stroke., 
'acute head trauma, multiple sclerosis and spinal cord injury 

Giai-m 5; Page 40-41; 45pp; English. 

nresent sequence is the coding sequence for human Acute Neuronal.. 
Sauced Calcium Binding Protein (ANIC-BP) . ANIC-BP coding sequence and 
ivSrin are useful for treating stroke, acute head trauma, multiple 
sc-rosis and spinal cord injury. ANIC-BP coding sequence and protein.- 
^ uefu as vaccines for inducing an immunological response in 



PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 

CC m=-ini.v.:i'l ■ 
XX 
SQ 



Sequence 1014 BP: 340 A; 205 C; 209 G; 260 T; 0 other; 

Query Match 100-0%; Score 1014; DB 22; Length 1014; 

Best Local Similarity 100.0%; Pred . No 3 . 5e-272 ; 

Matches 1014; Conservative 0; Mismatches 0; Indels 0, oaps 



Qy 

Db 



Db 

Qy - ; ' 



1 ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAA^ 60 

~ ii II II Mill I III II Mllll Ml II 1 1 M II M II 1 1 1 II I II I II I II 1 1 II 

, ATGAAAAA^TGCCTTTGTT^ 60 

-1 CTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 12 0 
Qy ol CTGAAAGACAATT ( j ( ! ( j ( , , , , , , 

Db 61 CTCAAAGACAATT'TCKS^^ 120 

19- -AAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 180 
Qy 12, .AAGTGTCTAAATC \C | | | | | | | || I I I II I I M II I I I I M II I I M I II III I 

Db 121 GAAGTGTCTAAATCACTG^ 180 

-CCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 240 . 
Qy .8 -^CCAACAGAAGCA ( ( ( ( j ( ^ { j , ( , ( ., , , , , 

Db ' 181 CCCCCAACAG^GCAGTGGCT 240 

,V XTTAACAACATC'TTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTA(3T 360;. 

IT mm 1 1 1 mi 1 1 1 1 1 1 M II M 1 1 1 M 1 1 II I II 1 1 1 II Ml I M I II 1 1 1 1 1 1 

, nrrr . ATrCTrATAT CCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 4 20 

Qy 7m i mm Ml I mi 1 1 1 1 1 1 II I II ill II 1 1 1 II I Ml 1 1 M 1 1 1 1 M I M I II 1 1 1 _ 

:r> ycTCATCCTCATATCCT ' 

.- , GTTGTGGGAT'i , ATGCT(3AGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 430.," 
Qy GTTGTGGGAT A1G , . . . , , , , , , , , , , , , | | , | , | M | | II II M I I I II II I I I 

^ ' ^ CGTTGTGGGATTATGCTC ^ 

-TT-CTAATCAATTCAGAGATTTCTTTAAGTAGGTGGAGTTGTCAACArrrGATATTC S40" : 
M ll^lltll M I I II I II II | II I II M II I I M I M I M I M I I M M I II I I M I M 
.I TTTTCTAATCAATTCAGA bW 

Qy 5-1 TCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGG 600.. 

Y 1 1 1 1 1 1 1 1 I I II 1 1 M II I II I II I II 1 1 I II Ml II II II I M II I 

-01 TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAG 66{1 

oy ooi M M 1 1 1 1 II I II MM II ! I M II 1 1 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 

Db SOI ijifCTTAGAACAAAATTACGAC 660 

AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCC^ 720 

Q/ aatta , | , , 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m i ii i ii ii 1 1 mm mm'' mi i 



Qy 
Db 



i I I I I I II I I II II II M II II I M M M M M M M I ! I I M M 

7.1 AACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAAC 780 

Oy 721 ^TTTGCCATC , , M || II M II I 1 1 1 1 1 M M ! M II I M M I M I II II 

Db 721 tIaCTTTGCCATCATG^ 780 

Qv 781 CTGCTTGGGGATAAAAGTCCGAACATCCAGTT^ 840 

i I M II II II M M II 1 1 II II M I II I M 1 1 1 II M M II I 

781 CTCCTTCGGGATAAAA^ 840 
Qy 341 GTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAA 900 



-■■■!■. 



'ft 



iillllllllllMIIIIIIIMIirilMMIIIIIIIiMIIIIIIIIMIMISIII 

cm CTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGAC 960 
Qy 901 ^ATTGAGTTTC ,,,,,,,,,,,,,,,,,, | , , | , | | | | | | | | | | | | | | | | | | | | | | 

961 GAGAAGAACTACTTGATTAAACAGATCCGAGA^ "14 

96i UgIU^^ 1014 



RESULT 2 
AAI5823 4 

ID AAI53234 standard; cDNA; 1421 BP. 

XX 

AC AAI58234; 
XX 

DT 22-OCT- 1?001 (first entry) 



Human polynucleotide SEQ ID NO 437. 
KW Human 



XX 
DE 

S -.man: nootropic; immunosuppressant; cytostatic; gene therapy; cancer; 
J! ^ripheral nervous system; neuropathy; central nervous system .NS 
:4 - V'Simer ■ * ; Parkinson ■ s disease ; Huntington • s disease ; naemosta.rc, . 
'-" ! I ." la .„. 1 o-ierosis- Shy-Drager Syndrome; chemotact.ro; 
SS^^S^SS^ lcr^ im , arthritis; i n f U-»tio„, ; . 
leukaemia; ss . 



77J 



OS ?&yc<yj sapiens . 
XX. 

pi-t v :o : ; . 0 U "j. 5 3 3 1 2 - A 1 . 



9r:> '2 6 - 01 JL -2001. . . . 

XX - 

PF 2.O-D3C-2000; 2 0 00WO-US342 63 , 
XX 

PR 21-IAN-2000; 2000US-0468725 . 

?!-< 25- APR-2000; 2000US- 0552317 . 

pr? 09-JUL-2000; 2000QS-0598042 . 

PR 19-JUL-2000; 2000US-.0620312 - 

PR 03-AUG-2000; 2000US-0653450 . 

PR - A-SEP -2000; 2000US- 0662191. 

PR 19-OCT-2000; 2000US-0693 036 . 

PR 2 9 -NOV.- 2 000; 2000US-0727244 . 

yv ' , ' 

FA (HYSE-) HYSEQ INC. 
XX 
PI 
PI 

?I 

XX 

DR WPI; 2001-442253/47 
DR P-PSDB; AAM39078. 
XX 
PT 
PT 



■vni^a- / «^ — ^ 

Tanc v T L iu C, Asundi V, Chen R, Ma Y, Qian XB, Ren F, Wang D; 

Wancr J Wang Z, Wehrman T , Xu C, Xue AJ, Yang Y, Zhang J; ■ 

Zhao QA, Zhou P, Goodrich R, Drmanac RT; 



Novel nucleic acids and polypeptides, useful for treating disorders 
such as central nervous system injur res - 



XX 

PS Claim 1; SEQ ID NO 437; 10078pp; English. 

XX 

CC The invention relates to human nucleic acids (AAI57798 -AAI61369) and 

CC the encoded polypeptides (AAM38642 -AAM42213) with nootropic, 

CC immunosuppressant and cytostatic activity. The polynucleotides are useful 

CC in gene therapy. A composition containing a polypeptide or polynucleotide 

CC of the invention may be used to treat diseases of the peripheral nervous 

CC system, such as peripheral nervous injuries, peripheral neuropathy and 

CC localised neuropathies and central nervous system diseases, such as 

CC Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 

CC lateral sclerosis, and Shy-Drager Syndrome. Other uses include the 

CC r^ilisation of the activities such as: Immune system suppression, 

CC Activiu/inhibin activity, chemotactic/chemokinetic activity, haemostatic 

CC and thrombolytic activity, cancer diagnosis and therapy, drug screening, 

CC assays for receptor activity, arthritis and inflammation, leukaemias and 

CC C.N.S disorders. 

CC Note: The sequence data for this patent did not form part of the printed 
CC :;p e o if i c a t i on . ■ . 

SQ Sequence 1421 BP; 469 A; 284 C; 306 G; 362 T; 0 Other ; 

Query Match 100.0%; Score 1014; DB 22; Length 1421; 

Best/uocal Similarity 100.0%; Pred. No. 4e-272; 

Ma«:ches 1014; Conservative 0; Mismatches 0; Indels 0; Gaps..- .0; 



Oyi 
•;.b 



■ 7 tgaaaaaaatgcctttgtttagtaaatcacagaaaaatc>gagcagaaattgtgaaaatc 60 
; MM II II I I I I II I II I III M I l-HI I I I 1 1 MM Mill I Ml I II III IN ill'i " 

-17 aTGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 2 76 
- ' " '"'TGAAAGACAATTTGGCCATTTI'GGAAAAGCAAGACAAAAAGACAGACAAGGCT.TCAGAA 1 2 0- 

iiiiillllilllMIIIIIMMIIIMIiMIIIMIIIMIIMIIIMIIIIMII 

27 7 CTGAAAGACAATTTGGCCATttTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 3 3 6 



121 GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTG.TGTGGTACAAACGAGAAAGAA 18 0 

" 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II I M 1 1 II 1 1 1 1 1 1 1 

Db 337 GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 



396 



Qy 



456 



HI f^cCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG. 24 0 

' T I M 1 1 1 1 M 1 1 1 ! 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 i M I E ! I 

Db 3 97 CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 

, 41 ACACTGATA.GCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 3 0 0' 

Q/ * * MiiiiiillllllllMIIIIIIIIIIIIIINIIIIIMUIIII II III MINIM ; 

Db 4B7 ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 516 

r , T , 01 TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 360 

1 " I | ) | | | | | I | | | | | | | | I ! I I I I H I I I M I I I I I I I I I I I I I I M 1 1 1 1 1 1 1 1 1 i 1 1 1 > e 

Db 517 TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 57 6 

GCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 42 0 

gctcatcc 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 II It M 1 1 i Ml Ml 11 

Db 577- GCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCGCACAGATTGCCTTA 63 6 

r .. 4 ,i CGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 480 

~ I M 1 1 1 1 1 M I M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 M M 1 1 1 M M II II I Ml ^ 

Db 637 CGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCC1C 696 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 



Qy 

Db 



TTTTCTAATCAATTCAGAGATTTCTTTAAG^^ 
TTTTCTAATCAA^ 

TCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGAC 

Ty 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 Mill I II in II in I 

TCAGATGCGTTTGCTACTTT^ 

TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAG 
TTCTTAGAACAAAATTACGACA^ 

AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACC^ 

till II I II II 111 Ml Ml Ml Mill 1 1 Mill 1 1 II MUM III Ml 1 1 

AACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAAC 
^CTTTGCCATCAT (| ,,,, , , | | | | | | | M I I I I II II M II I I I I I I 

AACT'ITGCCAT^TGACAAAGTA^ 

CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAA 

1 1 1 1 1 1 1 1| 1 1 1 1 1 II 1 1 1 M I II M II II II II 1 1 1 1 1 1 1 1 M II II I II II I Ml 1 1 

GTGGCCAGTCCTCACAAAACACAGCCTATTGTC 

1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 M M 1 1 1 1 1 M M ! 1 1 M I h I M ! 

0 7 JTGGCCAGTCCTCAC 

• • • - '^ rr ^ATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGAC 

' iTmltMl M I i I Ml II M Ml I Ml 1 1 M II II I II M II I M M I II 1 1 M III 

, 117 ctcaTTGAGTTTCTGAGCAG^ 
961 GAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCGGTTGA 1014 

TTTTTTvT 1 1 1 II 1 1 1 1 1 II 1 1 S 1 1 1 1 1 1 1! 1 ! M 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 _ 

II-/ GAG^GAACTACTTGATTA^CA 1230 



4 8*1 
697 
■S'-ll 
7 57 
601 
817 
6f>] 
87 7 
721 
937 
7&1 
J 9.7 

1 



540 
756 
600 
816 
660 
876 
720 . 
936 
780 
996 
840 
1056 
900 
1116 
960' , 
1176 



RESULT 3 
AAA27332 

ID AAA27332 standard; cDNA; 1344 BP. 

XX 

X):r ,^0-iNUG;-2000 (first entry) 

XX , , 

■35 Hvv^r-n calcium binding protein hLBP gene. 

XX 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 



Huran- calcium binding protein; cancer; inflammation; CBP; 
VeS^uctive disorder; autoimmune disorder; developmental disorder; 
seizure disorder; immune disorder; infection; ss. 



HcviiO sapiens. 



Key 
CDS 



Location/Qualifiers 
124 . . 1134 
/*tag= a 



/product^ "calcium binding protein" 



FT 
XX 

PN WO20C029580-A1. 
XX 

PD 25-MAY-2000. 
XX 

PF 12-NOV-1999; 99WO . XJS27027 . 
XX 

PR 13-NOV-1998; 98US-0190965 . 
XX 

PA (INCY-) INCYTE PHARM INC. 
XX 

PI Tang YT, Guegler KJ, Cor ley NC, GorgoneGA; 
XX 

DR WPI; 2000-387793/33. 

DR P-PSDB; AAY94 24 7 . 



?T or reproductive disorders 
XX 

PS Claim 9; Fig 1; 72pp; English. 

XX 
CC 

cc 

CC 

cc 



cc 

XX 
SQ 



• ,-.4lA > , 

3\ 



rep^oductivf disorders (such as infertility, ovulatory defects * • 

^ometriosis disruptions: of the oestrus and menstrual cycles,: 

" lr syndrome and ovarian hyperstimulation) , auto,— 

liSders (such as benign prostatic hyperplasia and prostatitis) ■ V 

ev'eioomental disorders" (such as Gushing ' s syndrome \,™^«^^ ^ 
CC and gonadal dysgenesis), hereditary neuropathies, eexzure dxsord-rs , 
CC immune disorders (such as AIDS, allergies anaemia, 
CC atherosclerosis, cholecystitis, Crohn's disease di - r aves 

CC disease, multiple sclerosis, psoriasis, rheumatoid arthr itis 

Scleroderma Siogren- s syndrome and ulcerative colitis) , and viral, 
bacteriarfungal parasitic, protozoal and helminthic infections. ■ 

Sequence 1344 BP; 450 A; 261 C; 280 G; 353 T; 0 other; 
ouer- Match 99.7%; Score 1010.8; DB 21; Length 1344; 



Qy 
Db 



i A^naaaaaaATGCCTTTGTTTAGTrv^ . — — - ,,,,,,,,, 

tTTTTTTTTTTTTT i 111111111111111111111111111111111111111111111 

■ t 24 ATGAAAAAAATGCCT 

' c- rTCAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 
Qy bx CTGAAAGACAATTTG ,,,,,,,,,,, ,,,,,, | ,,,,, | ,, | | | | | | | | | | | | | | | | | | 

Db 184 CTGAAAGACAATTTGGCCA 

r a a pT-nxCTAAATC ACTGC AAGCAATGAAAGAAATTCTGTGTGGT AC AAACG AGAAAGAA 

TTTTTTTTmTm? 1 1 1 1 1 1 1 1 1 1 1 1 H 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 " M 1 1 1 M 1 1 1 1 1 1 M 



AAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 



60 

183 

120 

243 

180. 



244 GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 303 
- v, CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 24 O 

mM^millllllllllMIIIMMIIIIINIIMMIIIIMIIIIIII III 

304 CCCCCGACAGAAGCAGTGG^ 363 
2-t! O.CACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA. 3 00 

tiMllllMIIIIIIIIMIMIIIIIIIIMMIIIMIIillMIIIIIIUIIII.I 

364- ACACTGATAGCTGACCTGC^ 423 
3 0 1 TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 3 6 0 

424 TTTAACAACATCTTGAGi^ 483 
-«i rr'rraTCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 420 

I! M Mill Ml Ml IMMMMMMilMMIMMIIIIMIMMIIIIIIII 

484 GCTciicCTC^ 543 
, n ,'^GTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATC 480 
" Til! I I I I I I I I i | | | | | | | | | | | | | ! | | | I I I I I I I I I ! M II ! I I I I I U I Mj I I 1 
544 CGTTGTGGGATTATGCTC 603 
• 81 TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 540 
- 81 ™TAATCAA | ! u M I I I II! I I I M ! 1 1 1 11 1 1 1 1 M I I I 

,04 ^Uci^ 663 
, ■ ; ''CAGATGCCTTTGCTACTTTCAAGGArTTACTAACCAGACATAAAGTGTTGGTAGCAGAC 600- 

' : mi 1 1 m 1 1 1 1 1 1 1 M I II II II II I M II II 1 1 M I! M I M I i M I M M I M I Ml I 

TCAGATGCOTTTG^ ' ' 

TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCT^ 66:0 i 

783 '• 

.,. , aTTaTrTTACTA AGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCAC 72:0, 

III I MM I Ml II MM I II 1 1 II II I II Mill I II II 1 1 1 i M II 1 1 II I M Mill 

734 UUItcUac!^ 843 
AACTTTGCCATCATGACAAAGTATATCA.GCAAGCCGGAGAACCTGAAACTC 780: 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M- 1 II II I II II II II I II I II II II 1 1 II I M I II II 1 1 

m AACTTTGCCATCATGACAA^ 903 
nql rTCCTTC GGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTT 840 

IM>MI illlMMMIMIMIIMMIMMIMIIMIIMIMIMIIMIIM 

:,04 CTCCTTCGGGATAAAA 963 
GTGGCCAGTCCTCACAAAACACAGCCtATTGTGGAGATCCTGTTAAAAAATCAGCCCAAA 900:- 

" YTYTTT^TTT 1 1 1 1 M ii i ii 1 1 ii 1 1 1 m 1 1 1 1 1 ii 1 1 1 1 ii m i m 1 1 1 1 1 1. 1 1 ii ii 

964 GTGGCCAGTCCTCACAAAAC^ 1023 
0 ■ > 1 CTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGAC 360 
TT^tTTl ! I I I I | | I I I I I I I I I I I I I I I I I M I I II I I I I I I I I I I I Ml I I ! M I I II 
1,24 CTCAl^GAGTTTCTGAGCA 1083 
SflV GAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA 1014 

II 1 1| | ii 1 1 MIM Mill MM I Ml II Mill II MIMI Mill Ml III 

1084 GAGAAGAACTACTTGATT 1«7 



RESULT 4 

AAH15879 " ^ 

ID AAH15.-879 standard; cDNA; 2002 BP. 

XX 

AC AAH15879; 

XX . 

DT 26-JUN.-2001 (first entry) 

XX 

DE Human. cDNA sequence SEQ ID NO : 144 07 . 

KW Human., primer; detection; diagnosis; antisense therapy; gene therapy; ss, 

XX 

OS Homo sapiens. 
XX 

PN EF1074 617-A2 . 

XX • 

PD --.W -?EB-- 2.0-0 1\: • < 

XX - • 

PF 2i- JUL --2.060; 2000EP-0116126 . 

XX • , - .- . 

t:- '-)-JUL- ;-9 39; 99 JP- 024 803 6 . 

:;7-AUG-1999; 99 JP- 03 0 02 53 . 

III - .iAN-2000; 2000 JP - 0118776. - 

PR ; - MAY - ? 0 0 0 ; 2000 JP-0183767 . 

PR -»9--JUN 2000, 2000JP-0241899. 

UiHVjX- ) SKLIX RES INST. ■ . > 

r ',.'..,:>, T . og ai T, Nishikawa T. Hayashi K, Saito K, Yamamoto. J ; , J 
ghii's,'"sugiyama T, Wakamatsu A, Nagai K, Otsukiw , 

>IPt; 2001-318749/34. 

KX ■ ^ c for synthesizing polynucleotides, particularly the- 5602 

yr Primer sets tor _ synthesizing ^/^^ . f icationf and for the detection 

PT 

PT 

PT full-length cDNAs 



Primer sets for synthesizing poiynuoic^^a, det ect 
full-length cDNAs defined in the specification, and for the detect 
ana/or oLgnosis of the abnormality of the proteins encoded by th. 



XX 
PS 

XX 



cc 
cc 

Cyl 
CC 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 



Claim 8; SEQ ID 14407; 2537pp + CD ROM; English. 

^o~viVv=»c. nr imer sets for ■ synthesi sing 5602 

.comprises, (a) an ongo f nolvnucleotide which comprises one of 

to f he complementary strand of a polynucleosis where the 

of an oliaonucleotide comprising a sequence complementary to the 

? Snrarv strand of a polynucleotide which comprises a 5 '-end 
SST^ SSSo^l-StlS comprising a sequence complementary to a 
Polynucleotide which comprises . ; 3 ■ -end -jquence. where the ^ ^ 

oligonucleotide «^ns^a ^ ; Selected from those defined in 
the - 5 '-end .-quence/3. "^^f^ be used/ in ant i se nse therapy and 
the specif ication T The prime for synthesis ing polynucleotides 

particularly full length cDNAs . The primers are also useful for the 



cc 
cc 



detection and/or diagnosis of the abnormality of the P»«J™ " Y 
tha Euli-langth «. The P"""' 1 » ; 3 6 ' ^ 3 Tand . 



CC 
CC 

cc 

CC of" the present invention 

XX 
SQ 



Sequence 2002 BP; 594 A; 418 C; 463 G; 527 T; 0 other; 
Query Match 97.9%; Score 992 . 8; HE 22 ; Length 2002; 

Best Local Similarity 99^8%; ^^^'^ Indels . 0 ; Gaps 0; 
Matches 994; Conservative 0, Mismatches 



Qy 

Db 



•9 TTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCTG^^ 78 

79 ATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAAGAA^ 138 

fil ATTTTGGAAAAGCAAGAC 120 
,„ rAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAACAGAAGCAGTG 198 
Oy ,39 ^CAATGAAAGAAA. . | 1 | | | | j | | | j | | | | | | | | | | | | | | | | | | | | | I I I I I I I I 



Db 



, 93 C.CTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTC «fl - 

I 1 m 1 M M I i I I M M I I I 1 I M I M 11 1 M I M N II I M I I M M 

r AGCTGATAG^CTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAACATC 318 ' 

?mt m 1 1 1 1 1 1 1 1 1 M III I M II II I M 1 1 1 M IN 1 1 1 I II M 1 1! i 1 1 1 1 

r ? r, CAGCTGATAGACT ^ 1 

*** i.GACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGTC 373 

II m i I 1 I I I I I I I I J H I M I I I M I I I I I I M I I I I II M I 
30r AGACAGATAGGCACT 360 

379 TTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTT^ 

! I I I 1 I i I I I I I I I I I I I I I I I II I I M I I I I I I I 1 

1 1 11 I I I I I I I II I I M I I I I III I I II II I I I I I 1 I 1 I I I M I 

AGAGAATGTATTCGACATG 480 

Q ,... ^ GATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATAT "8: 

J I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I ' "" 1 1 1 1 1 

,81 GATTTCTTTAAGTACGTGGAGT^ ^ 

-0 TTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGACTTCTTAGAACAAAATTAC 618 
cy 559 TTCAAGGATTTAC . . , , , , , , , , , ( | , | | , | | | | | | | | | | | | | | | | I I I I I I I I I 

, A1 TTCAAGGATTTACTAACCAGAC^ 6 °°= 

„ e GACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATTATGTTACTAAGAGA 678 
Qy ,1. GACACTAi ill , . 1 1 1 1 1 - , , , 1 1 1 J 1 1 1 1 1 I I J I I I I I I I I I I I I I 

601 GA cACTAtU T ^ 660 



Dfcr 

Db 



rw 67 9 CAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATGACA 73 8 

' ' i 1 1 1 1 1 1 1 1 1 1 1 1 1 II i 1 1 1 1 1 1 1 1 1 1 II 1 1 M M [I I IMM M iMMI J I ' 

Db 661 



CAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATGACA 72 0 



7 3 9 AAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAACCTCCTTCGGGATAAAAGT 7 98 

Qy | M I M t M 1 1 1 1 M 1 1 1 1 1 M 1 1 1 ! 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M I 

Db 72 i AAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAACCTCCTTCGGGATAAAAGT .7 8 0 

nw 799 CCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTTGTGGCCAGTCCTCACAAA 858. 

Qy 1 1 1 M 1 1 M | M 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

Db 70 i cccAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTTGTGGCCAGTCCTGACAAA 340 

Ov SB 9 ACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAGC 918 

Y " IIMIMMIMMIMMMMMMMMMMMMMMMIMM IMM 

Db S4i ACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAGC 900 

nv 919 AGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGACGAGAAGAACTACTTGATT 978 

Qy 1 1 1 1 1 1 II 1 1 M 1 1 1 Ml 1 1 II I II 1 1 M 1 1 Ml I M 1 1 III 1 1 1 ! 1 1 1 M I II II 1 1 II 

ub 9 oi AGCTTGCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGACGAGAAGAACTACTTGATT 960 

Ov 57 9 AAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA 1014 

MMM M Ml IMM IMM IMM II I III MM 

Db ?S1 AAACAGATCCGAGACTTGAAGAAAAGGGCCCCTTGA 996 





T ,. r i 








.4 71 






10 


^1054 71 standard; cDNA ; 322 


BP. 




<h*y}10547i; 






A A 








DT 


■;6-JUN-2 001. 


(first entry) 




XX 








DS 


Human cDNA clone (5 1 -primer) 


SEQ 


XX 








KW 


Human; primer 


; detection; diagnos 


XX 








OS 


Hovuo sapiens. 






XX 








PN 


BP1074617-A2 . 






XX 








PI) 


07- FEB-2001 . 






XX 








PF 


23-JUL-2000; 


2000EP-0116126 . 




XX 








X?R 


^9-JUL-1999; 


99JP- 0248036 . 




PR 


27-AUG-1999; 


99JP-0300253 . 




PR 


ll-JAN-2000; 


2000JP-0118776 . 




PI? 


02-MAY-2000; 


2000JP-0183767 . 




pp 


09-JUN-2 0 00; 


2000JP-0241899 . 




XX 








PA 


;he.li-} helix res inst. 




XX 








PI 


Ota T, Isogai T\ Nishikawa 


T, 


PI 


Ishi.i S, Sugiyama T, Wakamatsu 


XX 









DR WPI; 2001-318749/34. 
XX 

PT Primer sets for synthesizing polynucleotides, particularly the 5602 

PT full-length cDNAs defined in the specification, and for the detection 

PT and/or diagnosis of the abnormality of the proteins encoded by the 

PT full-length cDNAs - 
XX 

PS Claim 1; SEQ ID 2306; 2537pp + CD ROM; English. 
XX 

CC The present invention describes primer sets for synthesising 5602 

CC full-length cDNAs defined in the specification. Where a primer set 

CC comprises: (a) an oligo-dT primer and an oligonucleotide complementary 

CC to the complementary strand of a polynucleotide which comprises one of 

CC the 5602 nucleotide sequences defined in the specification, where the 

CC oligonucleotide comprises at least 15 nucleotides; or (b) a combination 

CC of an oligonucleotide comprising a sequence complementary to the 

CC complementary strand of a polynucleotide which comprises a 5 ' -end 

CC sequence and an oligonucleotide comprising a sequence complementary to a 

CC polynucleotide which comprises a 3 ' -end sequence, where the 

CC oligonucleotide comprises at least 15 nucleotides and the combination of 

CC the 5 '-end sequence/3 ■ -end sequence is selected from those defined in 

CC the specification. The primer sets can be used in antisense therapy and 

CC in gene therapy. The primers are useful for synthesising polynucleotides, 

CC particularly full-length cDNAs . The primers are also useful for the 

CC detection and/or diagnosis of the abnormality of the proteins encoded by 

CC the full-length cDNAs . The primers allow obtaining of the full-length 

"X; cDNAs easily without any specialised methods. AAH0 3 166 to AAH13628 and 

r 'C AAH13633 to AAH18742 represent human cDNA sequences; AAB92446 to' 

CC :\AE95893 represent human amino acid" sequences; and AAH13629 to AAH13632 

CC represe:?.c oligonucleotides, all of which are used in the exemplification 

CC or. the present invention. ■ 

KX - . ' ~ v 

SO - C^quence 322 BP; 268 A; 164 C; 171 G; 216 T; 3 other; ■■ a 

Query Match 76.0%; Score 770.6; DB 22; Length 822; 

Best Local Similarity 98.5%; Pred. No. 2.1e-204; 

Matches 798; Conservative 0; Mismatches 10; Indels 2; Gaps . . 2 



O" 19 TTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTTGGCC 7 8 

I I I I I I I I I I I I I' I I I I I I I I I I I ! I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I ! I I 
Phr 1 TTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTTGGCC 6 0 

Qv 7 9 ATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAAGAAGTGTCTAAATCACTG 13 8 

! M. II I i 1 1 1 M ! 1 1 1 1 II 1 1 1 Ml 1 1 1 1 1 1. 1 1 1 1 1 II i II I !l 1 1 ! I i I II : 1 1 , 1 i 

Db r Gl ATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAAGAAGTGTCTAAATCACTG 12 0 

Qy 13 9 CAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAACAGAAGCAGTG 198 

1 1 1 M 1 1 1 M 1 1 1 1 1 1 !l I h 1 1 1 1 il 1 1 1 II M I M I II I II I II III I M II 1 1 1 1 1 1 

Db 121 CAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAACAGAAGCAGTG 180 

Qy 19? GCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGATAGCTGACCTG 25 8 

illl MM MM MM I III MINIMI INI II III 1 1 ! i 1 1 1 1 1 1 1 1 M I i 1 1 i 1 1 

Db 101 GCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTGGTGACACTGATAGCTGACCTG 24 0 



Qy 

Db 



2 59 CAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAACATCTTGAGA 318 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 1 1 1 

241 CAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAACATCTTGAGA 3 00 



Qy 

Db 



o x q AGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGTGCTCATCCTCATATCCTG 3 7 8 

301 AGACAGATAGGCACTCGGAGTCCT 360 
„ 9 tttatGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTACGTTGTGGGATTATGCTG 438 

Qy 3.9 ff^™^r] |M ,,,111,1111,1,11111111111111111.1 1| MM || || Mil 

Db 361 i^TCCK^ 420 

Qv 439 AGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTCTTTTCTAA^ 498 

Y I 1 1 1 1 1 m I II 1 1 1 M 1 1 1 1 I 1 1 1 1 1 1 1 Mill II I II llll Him I 

.;>o PRTTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCTTCAGATGCCTTTGCTACT 55 8 

Qy . ^ ?tTTT?TTTtt?Itu 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m m in ii i ii i ii mi 

Db 481 G IiUcUiAA^ 

C c Q TTPAArraTTTACTAACCAGAGATAAAGTGTTGGTAGCAGACTTCTTAGAACAAAATTAC 618 
Qy 55 9 TTCAAGGATTlAv-TAACC | | ( | { ^ , ( ( 

541 TTCAAGGATTTACTAACCAGACA 600; 
619 GACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATTATGTTACTAAGAGA 6 78 

m mil 1 1 Him in ii mm mi n mi 1 1 n m mum mum , 

SOI GACACTATTTTTGAAGACTATG 66:0 
: ,. , CAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACT 738: 

! I II M 1 1 II II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II : ! 1 1 1 1 1 1 1 1 1 1| 1 1 1 1 1 1 II 1 1 

cagtctttaaagctgct^ 720 

. 7,,, AAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAACCTCCTTC^ 798. 

. j 1 1 ii 1 1! 1 1 M 1 1 1 1 1 Ml 1 1 1 1 1 1 ! I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 

jJvGTATATCAGCAAGCCGGAGAACC^ 778 ' 
7 oe -CCAACATCCAGTTTGAAGCCTTTCATGTT 82 8 

11 Ml II I II MINI!- M I Ml 

77 9 CCCAACATCCAGTTTGAGCCTTCTGGTTTT 808 



QY 



Cv 



Db 



AAX3S817 

IB AAX39817 standard; DNA; 831 BP. 

xx 

AC AAX3S817; 
XX 

DT 02 -JUL- 19 99 (first entry) 
XX 

BE. Gastric cancer associated gene. 
XX 
KW 

kv; 

KW prostate cancer; ss, 
XX 

OS Ho: no sapiens. 
XX 

FN WO 9 904 2 65 -A2 . 
XX ■ 

PD 20 JAN-1999. 



Cancer associated antigen; diagnosis; research; treatment; human; 
breast cancer; colon cancer; gastric cancer; renal cancer; lung. cancer, 



XX 

PF 15-JUL-1998; 98WO-US1467 9 . 



XX 

PR 22-JUN-199B 

PR 17- JUL- 19 97 

PR 10-CCT-1997 

PR 10 -OCT- 1997 

PR 10-OCT-1997 

PR ll-OCT-1997 



.«■-. • I 



CC 



98US-0102322 . 
97US-0896164 . 
97US-0061599 . 
97US-0061765 , 
97US-0948705 
97GB-0021697 



New isolated cancer associated nucleic acids and polypeptides - 
isolated using sera from cancer patients, used to develop products 
for the diagnosis, monitoring or treatment of cancers 



XX 

PA (LUDW-) LUDWIG INST CANCER RES. 

XX 

oi - hen y, Gout I, Cure A, O'Hare M, Obata Y , Old LJ; 
PI Pfreundschuh M, Sahin U, Scanlan MJ, Stockert 53; 
pt Tureci 0; 

XX 

DR WPI; 1999-132448/11. 
XX 
PT 
PT 
?T 
XX 

PS Claim 67; Page 558-559; 787pp; English. 

P -rn, invention relates to a method for diagnosing a disorder ^araoterised - 
^expression of a human cancer associated antigen precur coded ,f o, by 

n,cleio- acid molecule (MAM) . The method comprises: (a) contacting * 
^ ■ .d ] sample isolated from a .subject with an agent that specifically. 

an expression product or a fragment of an expression * 
-oduct -omplexed with an HLA molecule; and (b) determining the 
i'--eractIon between the agent and the NAM or the expression product as a 

termination of the disorder. The products and.methods can be used in 

U e diagnosis monitoring, research, cr treatment of conditions . 
^aracter^edby the expression of various cancer associated antigens. 
- n - provides nucleic acid sequences and encoded polypeptides 

CC wnich are cancer associated antigen precursors "P""^"^^ 
CC cancer, renal cancer, colon cancer, gastric cancer, prostate cancer and. 

CC lung cancer. . 
s£ Sequence 831 BP; 285 A; 165 C; 167 G; 209 T; 5 other; 

Query Match 57.5%; Score 684.6; DB 20; Length 331; 

Best Loral Similarity 96.1%; Pred. No. 1.9e-180; 

Matches 764; Conservative 0; Mismatches 23; Indels 8; Gaps.. 6, 



Qy 


1 


ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTG 

l !i I II HI! 1 1 III III Mill III Ml UN II IIIIM Mill llllll 
ItcIaaaaaItgcctttg™^ 


60 


Db 


37 


96 


Qy 


61 


CTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTC 

1 1 1 1 ii ii 1 1 1 II 1 1 II 1 1 II II 1 1 ! 1 II II M II II 1 II 1 II II 1 II II II 

CTGAAAGACAATTTGG^ 


120 


Db 


97 


156 


Qy 


121 


r AACTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 

^?TiMititniinniniinnnnninninninni!inniii 


180 


Db 


157 


216 



181 CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACT^ «<> 
2 ^ ACACTGATAGCTGACCTGCAGCTGATAGAC^ 300. 

33V U^AAC^^ 396 
GCTCATCCTCATATGCTGTTTATGCTCCTCAAAGGATATGAAGCCCCA^ 420 

MM I I I II I I I II I i I 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M II 



Qy 
Db 

Qy 

Db 



Db 

Qy 

Db 



4 -i ^GTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCC^ 480 

Qy Tm 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M I M 1 1 1 1 1 1 1 1 1 

CGTTGTGGGATTATC ^ 



Qy 

Db 
Ov 



. v. TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTT 540- 

Ffm m 1 1 II 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 
.. gl . 7 UU™^^^ 576 

« , -.t" GAGS A.TGC CTTTtS OT AC! TTT C — GG ATTT AC T C AG AOTV'X' AAAG 'rGTT GGrT AG C. — A G 5 9.8 , 
^CTTrTTAGAACAAAATTACGACACTATTTTTGAAGAOTATGAGAAAT^ 65Br9 

; - n/ : ' -' MM I I I I I | | I I I I I I I I I I M I | | 1 1 1 H I 1 1 1 1 1 1 M l 1 1 1 M 1 1 II I I 

. 1; , G ^AXTATGTTACTAAGAGACAGTCTTTAAAG - CTGCTAGGGGAGCTGATCCTGGACCG 716- 
QY " M ill II M Mil III I II I MINI I I I 1 1 1- 1 I I I I.I II I I I I 

717 TCACAACTTTGCCATC - ^^^^^^T^T^T^^^'f ^ " ~ 772 

75 v IcLUcIUg^ 



Cy 

Db 

Qy 

Db 



773 TGATGAACCTCCTTC 7 87 

M 1 1 ! 1 1 1 M I M 

" LV17 GGAGGAACCTCCTTC 831 



RESULT 7 
AAI60020/C 

ID AAI60020 standard; cDNA; 1191 BP. 
XX 

AC AAI6 0 02 0; 

XX 

DT 22-OCT-2001 (first entry) 

XX 

DE Human polynucleotide SEQ ID NO 4009 

XX 
KW 
KW 



nuuian piw-i. / ^>^.^ — 

Human- nootropic; immunosuppressant; cytostatic; gene therapy; cancer; 
peripheral nervous system; neuropathy; central nervous system, OTS, 



KW leukaemia; ss. 
XX 

OS Home sapiens . 
XX 

PN WO200153312-A1. 
XX 

PD 2 6- JUL- 2 001. 
XX 

PF 26-DEC-2000; 2000WO-US34263 . ^ 

PR 21--JAN--2OO0; 2 000US - 04 88725 . 

PR 25-APR-2000; 2 0 0 OUS - 0552 3 17 . 

PR 09-JUL-2000; 2000US-0598042 . 

or 19-JUL-2000; 2000US-0620312 . 

PR 03-AUG-2000; 2000US-0653450 . 

PR 14-SSP-2000; 2000US 0662191. 

PR l^-OCT-2000; 2 00 0US - 06 93 03b . 

PR 23-NOV-2000; 2000US-0727344 . 

XX ' ' ~ 

PA (HV3E-) HYSEQ INC. 

XK ... T , u c AS undi V. Chen R, Ma Y, Qian XB, Ren F, Wang:©; 

S a;:',^: Wehrman T, XuC, Xue^, Vang X, Zhang, a , »: ■ 

V; cm, Zhou P, Goodrich R, Drmanac R.C; -■ 

rj;l KPX; 2CC.L 442253/47. . ■ '-. i . 

c-PSDB; 7NAM40864. 

S nucleic acids and polypeptides, useful for treating disorders, 

p;;< Yuch as central nervous system injuries - 



/"V 
PS 
XX 

<-1 '-I 

CO 



Claim 1; SEQ ID NO 4009; 10078pp; English. 

The invention relates to human nucleic acids (AAI57798-AAI61369) and . 
Tne J: nventiuii A AM42 2 13 ) with nootropic, 

the encoded P^^xd^^38642 MM422 13,^ nucleotides , are useful 

immunosuppressant and cytostatic activi y f i po i ynu cleotide 

O, in gene therapy. A. ^"^^^EelJS S che peripheral nervous 
cc of the i^f ^"^Sla^injSi:., peripheral neuropathy and 
CC system, such as peripheral n ner vous system diseases, such as 

CC localised neuropathies and central J disease, amyotrophic 

e ssir.;;^s rs uses 

,jc lateral sclerosis a * Immune system suppression, 

.utilisation of the J^ivities such actlvity| haemos . tatlc 

Activin/inhibin^activxty c *-^* c J ig and therapy , drug screening 
^"rfceptractivitrarthritL and inflammation, leucaemias and 

£ Note 5 "uence data for this patent did not form part of the printed 
CC specification. 

noi rp. ^48 A- 261 C; 236 G; 346 T ; 0 other; 
SQ: Sequence 1191 BP; 34« a, ^ox ^, 

67.5%; Score 684.4; DB 22; Length 1191; 
Query Matcn ° J ' 1fln , 

Best Local Similarity 99.9%; Pred. No. 2.6e-180, 



CC 

cc 
cc 
cc 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 



Db 

Db 

Qy 

Db 



Matches 685, Conservative 0; Mismacches 1; Indels 0; Gaps 0; 

,29 GCACTCGGAGTCCTACTGTGGAGTATATTAGTGCTCATCCTCAT 38B 

1189 gcacTCGAAGTC^ 1130 

3S , TCAAAGGATATGAAGCCCCACAGATTGCCTTACGTTGTGGGATTA^ 44B 

112STC JJJ^ 

449 TTCGACATGAACCACTTGCCAAAATCATCCTCTTTTCTAATCAA ^08 

, 0,9 UcGAci^ 1010 

,09 ^tacgtggagttgtcaacatttga™ 568 

1009 agtacgtggagttgtcaacatttgatattc 950 

3 * 9 TACTAACCAGACATAAAGTGTTGGTAGCAGA^ ' 
i> 29 TTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATTATGTTACT 

,3, TTGAAGACTATGAGAAATTGCT^ 839 ' 
,,, -"'-GCTGCTAGGGGAGCTGATCCTGGACCGTCACAACT^ 

" 74S GC'AAGCCGGAGAACCTGAAACTCATGATGAACCTCCT 3 ° 8 - 

759 GC ]^ 710 

,09 AGTTTGAAGCCTTTCATGTTTTTAAGGTGTTTC 368 

709 AGTTTGAAGCCTTTCATOT 650 

8S9 TTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCT 928. 

649 TTGTGGAGATCCTC 590 

,29 AAGAAAGGACGGATGATGAGCAGTTCGCTGACGAGAAGAACTA^ '** 

5,9 1JJ; AAAGGACGGATGA^ 530 

08 9 GAGACTTGAAGAAAACGGCCCCTTGA 1014 

| II I I | | M I II I I I I I M I I I I M I 

529 GAGACTTGAAGAAAACGGCCCCTTGA 504 



RESULT £ .. i 

AAC917 7'-' 

Id' AAC91772 standard; cDNA; 1026 BP. 

XX. 

AC AAC91772; 



27-MAR-2001 (first entry) 

Human ANIC-BP (acute neuronal induced calcium-binding protein) cDNA 
Human- acute neuronal induced calcium-binding protein; ANIC-BP; 

cerebroprotective; neuroprotective; ss. 



DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
XX 

OS Homo sapiens. 
XX 

PN WO200078947-A1- 

XX 

PD 28-DBC-2000. 

PF 14-xTON-2000; 2000WO-EP05457 . 

PR 2-2-JUN-1999; 99EP- 0112 024 . 
XX 

PA (MERE ) MERCK PATENT GMBH . 

XX 
PI 

XX 

DR V/PX; 2001-102721/ 11 

;>:< p-PSIAB; AAB48970. 



De„ Daas I, Fischer V, seyfried C, vol, Melchner L; 



treating acute head trauma, btroKe, mux.-j-p. 



XX 
XX 



ir j ury 

Olaim 5; Page 35-36; 50pp; English. 



CC 



-ion relates to human acute neuronal induced calcium-binding • 
•me invention relates to nui Pn codinq it. The invention , - 

pxotein (ANIC-BP) and to ^^^^^^/^^inant host cells comprising 
also relates to expression systems ^rjcombinan ies specific 

NTIC-BP DNA, the recombinant production of MIC BP an QbuUn 

for ANIC-BP, fu -i«-P^^ r 2S S Modulators of ANIC-BP function. 
Fc region, and methods of screening L _ d Mo25 proteins. 

C,- . ANIC-BP has homology and structural "^Jf .^^e^ing stroke- and 
CC ANIC-BP proteins and -^f^^^; f p ^1 S injury. ANIC-BP 
CC acute head ^auma multiple sclerosi, p ^ ing membrane bound 

CC proteins are useful m s «eening vac ^ ines ANIC-BP nucleotides are 
CC or soluble receptors, and also m vaccine^ tissue expression studies,. 



CC 
CC 

cc 

CC cON^ encoding human ANIC-BP. 
XX 
SQ 



in9 , RP . 359 A . 199 C; 203 G; 265 T; 0 Other; 
Sequence 1026 BP, ^> - L ^- / ' 

■ ■ . S7.5%; Score 582.6; DB 22; Length 1026; 

Ouerv Match ^ j " c i • 

Best' Local Similarity 74.7%; Pred N^^e -152, . 
Matches 748; Conservative 0; Mismatches 244, 

, 8 GTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTTO 77 



Db 12 



78 CATTTTGGAAAAGCAAGAG AAAAAGACAGACAAGGCTTCAGAAGAAGTGTC 128 

il li II 1 1 II I II III Ml 1 1 1 1 MUM M MM MM M 

72 tgUcTGGAAAAGCAAGACATTTCTGATAAAAAAGCAGAAAAGGCTACAGAAGAAGTTTC 131 
1 ? 3 T^AATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAAC 188 

mi mi 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II I M Ml 1 1 1 1 I M 

13.2 CAAAAATCTGGTTGCCATGAAAG 191 
.89 AGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGAT 248 

' 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I M II 1 1 1 1 1 II MINI M I II I 

132 AGAAGCAGTAGCTCAAG^ 251 
,,o z^GCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAA. 308 

1 1 1 1 1 1 I Mill II 1 1 1 1 II 1 1 1 ! I II 1 1 M 1 1 III I II M II 1 1 1 1,1 

252 AGCTGATTTACAGCTCATTC 311 
309 CATC-TGAGAAGACAGATAGGCACTCGGAGtCCTACTGTGGAGTATATTAGTGCTCATCC 368 

II "i Mi Mill III! II I I Mill II 1 1 INI II I Mil 

TATTCTCAGAAGACAAATTGGTA^ 371 / 

369 TCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTACGTTC 42.8 

1 1 1 1 MM Ml I I Mill I II II I I III I M M II I M II I 

372 GAATATTTTGTTCATGTTATTG «1 ' 

A29 GATTATGCTGAGAGAATGTATTCGACATGAACCA 488' ' 

li Mil III II I M- I M I M II II 1 1 1 1 1 II Ml II I II M il l 

, 2 AATAATGTTAftGAGAATGCATCA^ 4 91 

■/ »0 TCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCTTSAGATGC 548- .. 
W -CAATTCAG \\ \\ \\ I II I II M II Ml I MM II I I Ml I I 

.ft?.? jscACtTTTTATGATTTCTTCAGATATGTCGAAJvT'G 5 51, . 

ctttgctactttcaaggatttactaaccagacataaagtgttggtagcagacttct™ ccs, 

. 552 atTTGCCAC^ 611 
,09 ACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATT 668 

Ml MM Ml MM MM Mill i II Mill MM MM Ml 

612 icAGCATTATGATAGATTTTTCAGTGAATATGAGAAGTTACTTCATTCAGAAAATTATGT 671 
TACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGC 728 

• ' " ' li 1 1 m II I M I I Mill II III M I M M I I M I M M I 

. 6.72. GACAAAAAGACAGTCACTC 731 
7,9 CATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAA 788 

il 1 1 1 1 1 1 1 1 II Mill M II II I II 1 1 II II 1 1 M II 

732 AATTATGACAAAATACATCAGTAAACCTGAG ™ 
789 GGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTT^ 848 

M Mill II II MUM III MM MM I II II II II M I II II MM 

'mX AGACAAAAGTCGCAACATCCAGTTTGA^ 851 ' 
849 TCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGA 908 

i I I I I I I I M Mill II I II MM I II Ml I II I Ml I II M 

352 TCCTAAC/AGACGCAGCCCATCC 911 



Cv 963 CTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCC 1009 

Y n ill II I II 1 1 1 M I M 1 1 1 1 M I M I 

Db Q72 CTATTTAGTOAAACAGATCAGGGATTTGAAGAGACCAGCTC 1012 



RESULT 9 

m K13 ABK13127 standard; cDNA; 3281 BP. 

XX 

AC ABK13I27; 
XX 
DT 

XX ^n^^lpnfide- (BDtm) CDNA (481257.3) 

DB 
XX 
KVJ 
KW 
KW 

KW 
KW 
IIW 
KW 
KW 



09-APR-2002 (first entry) 
Human secretory polynucleotide (sptm) 

■ -. ^ f i p , t-ra ismembran^ domain; human; sptm; ss; gene; 
Signal peptide; transmemoran atherosclerotic ; antipsoriatxc ; 

481257.3; ant iartenosclerotic ; ^ nt ^ he ^!^^ lerqi ; . antidiabetic;.- . 
a: ,tii„£ la ™atory, cytostatic, .nt HIV . .™ro . 
neurotropic; antxgout antxtnyro , p dermatological-; cancer; • 
osteopathic; "tirh^tjc; »^ vulnerary; gout; 

ir.,unosuppressxve; «^«; t ^. nootro * ic; antiparkinsonian; . 
-ticcuvuisant; ^^ P ^f pro i if er ative disorder; arteriosclerosxs ; 
:S^s" l™ system disorder; inflammation; 

, 1! ILiency syndrome; AIDS ; Addison's disease; , , • 
acquired immunodeti^iency .. all ~ rrn ,. cirrhosis; osteoporosis ; 

adult respiratory distress syndrome; allergy, ' OBte0 ^ tteitl s. 

diabetes mellitus; ^^J^^^^^ ulcerative, colitis ; 

f hetim f ° ld ;!f c ^i.^^icS disorder; stroke; epilepsy;, 
haematopox^xc^cancer f ^ disease; meningitis; prion dxsease; 

* u !^ cerebral palsy; myasthaenia gra.xs-:; .. 

diabetic neuropathy; Alzheimer's dxsease. 



KV,; kuru ; 



is. vi 

XX 

CO Homo sapiens 
XX 
PN 
XX 

PD 15 -FEB -2 001. 
XX 

PR 05- AUG 1999; 99US- 147500P . 
PR 05- AUG-1999; 99US-147501P . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 



XX 
DR 
XX 



WO200111032-A1. 



oi-JCN.-20-OO-; 2000WO-US15246 . 



XX • 1in nt? r USS o FD Spiro PA, Banville SC; 

PI «,» », Rosen BH, Chal»p«s, 

11 » " ». — « »• Poseberry AM; 

PI Wright RJ, Daniels SE; 



WPI; 2002-147236/19 . 



PT Nov°l secretory polynucleotide (sptm) and polypeptides encoded by sptm, 

PT userul for diagnosing and treating disorders or diseases associated 

H with cell signaling e.g., allergy, psoriasis, Grave's disease, epxlepay 

PT 
XX 

pg Claim 1; Page 192-193; 198pp; English. 



XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 



This invention relates to novel cDNA molecules encoding "dated 
secretory polynucleotides (sptm) with similarity to signal peptide 
(SP) or transmembrane domain (TM) consensus sequences^ The 
polynucleotide sequences of the invention are useful for producing 
spi-m protein by recombinant techniques, the protein may be used to 
qenerate anti-sptm antibodies which may be used to analyse protein 
expression levels in different tissues. The sptm molecules are useful 
^diagnostic and therapeutic purposes e.g., to diagnose or treat a 
*"ondition associated with cell signaling such as a cell proliferative 
isomers (e.g., arteriosclerosis, atherosclerosis, psoriasis cancers) , 
CC Immune system disorders (e.g.., inflammation, acquired immunodeficiency 
CC syndrome ^ AIDS) , Addison's disease, adult respiratory distress syndrome, 
allergies, cirrhosis, diabetes mellitus, gout, Graves' disease 
multiple sclerosis, osteoarthritis, osteoporosis, rheumatoid arthritis, 
systemic lupus erythematosus, ulcerative colitis and haematopoietic 

!f C er a neurological disorder (e.g., stroke, epilepsy Huntington' s 
-"ease' Parkinson's disease, meningitis, prion diseases including kuru, 
'--utzfeldt Jakob disease, cerebral palsy, myasthenia gravis, diabetic 
V.^uropathy and Alzheimer's disease) . Sptm sequences can be used to ,., 
-'-tect the presence of or quantifying the amount of sptm-related • 
Polynucleotide in a sample. The sptm polynucleotide is used ^design 
;-obes useful in diagnostic assays carried out ^detect or confirm _ 
conditions, disorders, or diseases associated with abnormal levels of . 
Sm expression. Sptm, its fragments or oligonucleotides derived from 
:ntm may be used as primers in amplification steps prior to ^ _ 
hybridisation. The present sequence represents c.he human sptm (4812,57. i) 
CC cplS'A sequence of the invention. 

% Sequence 3281 BP; 1014 A; 601 C; 676. G; 990 T; 0 other; 

n--ry Mcttch 57.5%; Score 582.6; DB 24; Length 3281; 

Be~t Local Similarity 74.7%; Pred. No. 9.3e-152; _ 9 - Gaos '■ 1- 

.Matches 748; Conservative 0; Mismatches 244; Indels 9, Gaps 

O,, 1« GTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTTGGC 77 

' 1 1 1 1 1 M 1 1 1 i 1 1 1 1 I • 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 H I 1111 

101 gtUgggUgtctcac^ 160 

ov 78 CATTTTGGAAAAGCAAGAC AAAAAGACAGACAAC^CTTCAGAAGAAGTGTC 128 

1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1. 1 1 1 1 1 1 II 

161 TGTTCTGGAAAAGCAAGA^ 220 
,29 TAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAAC 188 

Ml ill I i I M 1 1 II II 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 M I 1 1 I H c , 

221 cAAAAATCTCGTTGCCATGa'^GAAATTCTGTATGGCACAAATGAAAAAGAGCCTCAGAC 2 80 
Qy 189 AGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGAT 248 

1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 II I II I 1 1 M ill! 1 1 1 1 1 I 

Db 281 AGAAGCAGTAGCTCAACTTGCTCAAGAA 340 



CC 

cc 
cc 
cc 

C : ' '; 
CC 



cc 
cc 
cc 



Db 

Qy 

Db 



,49 AGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGM 308 

I I I I I I I Mill II I I I I I II II I I 'I I 1 11 1 1 1 11 

341 AGCTGATTTACAGCTCATTGACT 4 °° 

309 CATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTACT 368 

XATTCTCAGAAGACAAATTGGTACGAG «« 
369 TCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTACGTTGTGG 428 

5 ,-L ^ Itaatgttaagagaatgcatcaga^ 5 ^ 

" 581 ACAGTTTTATGATTTCTT^ 640 
,49 CTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGACTTCTTAGA 608 

iii 1 1 II 1 1 1 1 1 1 1 1 | 1 1 1 1 1 II II II l! II II 

r'^TTTGCCACATTCAAGGAT^ 7 °°\ 

609 "'-AAAATTACGACACTATTTTTGAAGAC 

, mm I | I I 1 I I I III II ! i -1.1 i I I I I I 

. 7 ot jcIgcattatgatagatttttcagtg 76 °- 

K- v: > 'JACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATC ™' 

■ : - i i II I I M l I I I 1 I M l I I I I I I I I I M I ■ I 

n ■ ,,,, G iiJli G icAGTCA 

l. - ^ CATCATGACAAAGT^^ 788 ^ 

*' J II MM II 1 I II I I I I I I i i 1 1 I I M II M I II ll 

321 AATTATGACAAAATACATCACT 880 
CY 789 GGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTT B48 , 

i^iUiJLkU^ 1000 

9 V. 0TTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGAT3AGCAGTTCGC 96B - 

Vii ii 1 1 i i ii ll II II 1 1 1 1 I 1 1 1 1 M 1 1 1 

,: 001 UUU^ 1060 

r 969 CTACTTG ATTAAACAGATC^^ 1009 

-, 06 i CTATTTAGTTAi^CAGATCAGGGATTTGAAGAGACCAGCTC 1101 



Qy 

Dt 

Qy 

D.b 

Qy 

Db 

Qy 

Db 



Db 

Qy 

Db 



RESULT 10 

ABV22S87 

ID ABV22 937 standard; cDNA; 3 84 9 BP. 

XX 



AC ABV22987; 
XX 

DT 13-SEP-2002 (first entry) 
XX 

DE Human prostate expression marker cDNA 22978 



XX 
KW 



Human; prostate cancer; cytostatic; carcinogen; pharmacodyanamic marker; 

KW pharmacogenomic marker; gene; ss. 
XX 

OS Homo sapiens . 
XX 

PN WO200160860-A2 . 
XX 

PD 2 3-AUG-2001. 

XX 

PF 20-FEB-2001; 2001WO-US05171 . 

xx - ■ * 

PR 17-FEB-2000; 2 0 0 OUS - 183 3 19P . 

PR 16-MAR-2000; 2000US-189862P . 

PR 25-MAY-2000; 2000US-207454P . 

?R 09-JUN-2000; ■ 2000US-211314P. ■ 

PR 18-JUL-2000; 2000US-219007P . 

?R - 13-DEC-2000; 2000US-2552 81P . 

XX ' 

PA J MILL-) MILLENNIUM PREDICTIVE MEDICINE INC. 

XX 

•?r ::clilegel R ■ Endege WO, Monahan JE; 

I.... mi; 2001-562795/764 - -' ?: ' ' ' 

J'. 1 Novel isolated nucleic acid molecule associated with cancerous- state of 

pr- prostate ceils and correlating with presence of prostate cancer, useful 

;.>':? ^'pr detecting presence of prostate cancer, stage of prostate cancer - . 

V.3 Claim 1; Page 4088; 11750pp; English. 

XX 

CC The invention relates to an isolated nucleic acid molecule (I) comprisxng 

CC a nucleotide sequence given in Tables 1-9 (ABV00010-ABV62213 ) of the 

CC specification or its complement. (I) is useful for: 

CC (a) assessing whether a patient is afflicted with prostate cancer; 

CC (b) monitoring the progression of prostate cancer in a patient; 

CC (-j) assessing the efficacy of a test compound to inhibit prostate 

CC cancer in a 'patient; 

CC (d) assessing the efficacy of a therapy for inhibiting prostate cancer 

CC . in ci patient; ■ 
CC (e) selecting a composition for inhibiting prostate cancer in a patient; 
CC (f). assessing the prostate cell carcinogenic potential of a compound; 
CC (g) determining whether prostate cancer has metastasized in a patient ; 
CC (h) assessing the aggressiveness or indolence of prostate cancer in a 

CC patient; 

CC (I) is also useful as a pharmacodyanamic or pharmacogenomic marker. 

XX 

30 Sequence 3849 BP; 1142 A; 745 C; 358 G; 1081 T; 23 other; 

Querv Match 57.5%; Score 582.6; DB 23; Length 3849; 

Best" Local Similarity 74.7%; Pred. No. le-151; 

Matohes 74 8; Conservative 0; Mismatches 244; Indels 9; Caps 1 



437 



GTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTTGGC 7 7 

im II II MINI MINIMI II I M II I I Mill II I MM 

GTTTGGGAAGTCTCACAAATCTCCAGCAGACATTGTGAAGAATCTGAAGGAGAGCATGGC 



496 



-7« nATTTTGGAAAAGCAAGAC AAAAAGACAGACAAGGCTTCAGAAGAAGTGTC 12 8 

1 1 M Mill MM MUM 1 1 1 1 1 1 1 1 1 1 M 

4 97 TGTTCTGGAAAAGCAAGACATTTCTGATAAAAAAGCAGAAAAGGCTACAGAAGAAGTTTC 556 



129 



TAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAAC 

Ml Ml M II I M M II I II I M I Ml Mill II Mill M I M 

CAAAAATCTGGTTGCCATGAAAGAAATTCTGTATGGCACAAATGAAAAAGAGCCTCAGAC 



188 

616 



1 39 AGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGAT 24 8 

Ml Mill I Mill II II I II II MM II I Mill M II 1 1 1 1 1 ! 

617 AGAAGCAGTAGCTCAACTTGCTCAAGAACTCTATAATAGTGGGCTCCTTAGCACCCTGGT 67b 



->4 9 AGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAA 

" "MM I Mill Ml Ml II MM I III Mill III I II M II Mill 



3 08 

377 AGCTGATTTAGAGCTCATTGACTTTG ' 736 



3 "9 CATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGTGCTCATCC 368. 

II | llllllll I I II M I I Mill II II II II II I I II I 
7 3 7 TATTCTCAGAAGACAAATTGGTACGAGAACTCCTACTGTTGAATACATCTGCACgCAACA 796- 

x^ATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTACGTTGTGG 128 

" ~Ui| MM Ml I I IMM MUM I. ill I M II. Ml MUM 

; ,. . ; - iAATAXTTTG TTCATGTTATTGAAAGGGTATGAATCTC:CAGAAATAGCTCTAAATTGTGG 856-f. 
'v GATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATGATCCTCTTTTCTAA 4 8fi, - 

II Ml I Illllill M MIMMMMMIM llllllll I I .II I . 

:;«'-7 AATAATGTTAAGAGAATG.CATCAGACATGAACCACTTGCAAAAATCATTTTGTGCiTCGGA 91f>- 
TCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATT0CTTCAGATGC 548 

|| || M II II II Mill IMM I II II 1 1 MM I III! 

917 ACAGTTTTATGATTTCTTCAGATATGTCPAAATGTCAACATTTGACATAGCTTCAGATGC 976 

54 9 CTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGACTTCTTAGA 608 

'| | || | || | M I I i M II II M II I II II II II II I Mill II M M 

977 ATTTGCCACATTCAAGGATTTACTTACAAGACATAAATTGCTCAGTGCAGAATTTTTGGA 1036 

0 9 ACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATTATGT 668 

||| | | | | I | | | | II II llllllll II Mill II II llllllll 

1 0 3 7 ACAGCATTATGATAGATTTTTCAGTGAATATGAGAAGTTACTTCATTCAGAAAATTATGT 10 96 



669 TACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGC 728 

, II II llllllll I Mill M II II M I II II I I M I 

1097 GACAAAAAGACAGTCACTGAAGCTTCTCGGTGAACTACTACTAGATAGACACAACTTCAC 



1156 



72- CATGATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAACCTCCTTCG 78 8 ■ 

' | | ' | | | | | | 1 | || Mill II II llllllll III I MIMIIMM M II 
1 IS 7 AATTATGACAAAATACATCAGTAAACCTGAGAACCTCAAATTAATGATGAACCTGCTGCG 1216 

789 GGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTTGTGGCCAG 84 8 

|| 1 1 1 1 ; 1 1 1 1 II 1 1 1 1 II II II 1 1 MUM! Mill MM 

1 2 17 AGACAAA AGTCGCA ACATCCAGTTTGAGGCCTTTCACGTTTTTAAGGTGTTTGTAGCCAA 12 76 



Ov 84^ TCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGA 908 

7 1 1 1 1 1 1 1 1 { | | HI | || IN Mill I II II III Mill Mill II 

Db 3277 TCCTAACAAGACGCAGCCCATCCTAGACATCCTCCTCAAGAACCAGGCCAAACTCATAGA 1336 
Ov 909 GTTTCTGAGCA.GCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGACGAGAAGAA 968 

Qy Ml II MM M II II M Ml DIM II Ml Ml IN I II II I III I 

Db -'-337 GTTCCTCAGCAA< 



GTTTCAGAACGAC AGGACGGAGGATGAGCAGTTTAACGACGAGAAGAC 13 96 



nv 96 q CTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCC 10 0 9 

Y |H M I Mill I MM I II MUM I I Ml I 

Db 13 97 CTATTTAGTTAAACAGATCAGGGATTTGAAGAGACCAGCTC 14 3 7 

RESULT 11 

ABV2 8 8 22 5 
ID ABV2 8 822 -standard; cDNA; 3,849 BP. 

XX 

AC ' ABV2 8822; 

XX 

DT 16-SEP-2002 (first entry) - *f 

XX - • "TT 

DE Human prostate expression marker cDNA 28813. 

% Human; prostate cancer; cytostatic; carcinogen; pharmacodynamic marker; || 

pbarmacogenomic marker; gene; ss . - ■* 

xx •" •"' ... 

OS Homo sapiens. 

XX i 

_?N WO 2 0016 0 860 -A2 - 



PA (MILL- ) MILLENNIUM PREDICTIVE MEDICINE INC. 

XX 

?I Schlegel R, Endege WO, Monahan JE; 

DR WPX; 2001-662795/76. 



Novel isolated nucleic acid molecule associated with cancerous, state of 
prostate cells and correlating with presence of prostate cancer, useful 
for detecting presence of prostate cancer, stage of prostate cancer - 

Claim. 1; Page 6066-6067; 11750pp; English. 

The invention relates to an isolated nucleic acid molecule (I) comprising 
a nucleotide sequence given in Tables 1-9 (ABV00010-ABV62213) of the 
C specification or its complement. (I) is useful for: 
CC (a) assessing whether a patient is afflicted with prostate cancer; 



PT 
PT 

PT 
XX 
PS 
XX 
CC 
CC 



XX '■ - .... 
; '■ ■ 23-AQG-.20.0r. 

S0--FK3-2001; 2 0 0 1W0 •- vJS Co 1 71 . 

17-FEB-2000; 2000US- 1833 19P . 

vyp ' 16-MAR-2000; 2 0 00US - 18 9862P . 

PR 2-5 -MAY- 2 00 0; 2000US-207454P . 

PR 09-JUN 2000; 2 0 0 0US - 2 113 14P . 

PR 13-JUL-2000; 2000US-2 19007P . 

PR 13-DEC-20C0; 2 0 0 0US - 2 552 8 IP - 
XX 



CC (b) monitoring the progression of prostate cancer in a patient ; 

CC (c) assessing the efficacy of a test compound to inhibit prostate 

CC cancer in a patient; 

CC (d) assessing the efficacy of a therapy for inhibiting prostate cancer 

CC in a patient; . ,..:„„<- 

CC te) selecting a composition for inhibiting prostate cancer in a patient; 

CC (f) assessing the prostate cell carcinogenic potential of a compound; 

CC (g) determining whether prostate cancer has metastasized m a patient; 

CC (h) assessing the aggressiveness or indolence of prostate cancer m a 

CC patient; 

CC (I) is also useful as a pharmacodyanamic or pharmacogenomic marker. 
XX 

SQ Sequence 3849 BP; 1142 A; 745 C; 858 G; 1081 T; 23 other; 

Query Match 57.5%; Score 582.6; DB 23; Length 3849; 

Best" Local Similarity 74.7%; Pred. No. le-151; 

Matches 748; Conservative 0; Mismatches 244; Indels 9; Gaps i; 

••B GTTTAGTAAATCAC^CAAAAATCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTTGGC 7 7 

^ INI I || II | 111 ||. IIIIMIM MINIM I HIM M I Mil 

4 37 GTTTGGGAAGTCTCACAAATCTCCAGCAGACATTGTGAAGAATCTGAAGGAGAGCATGGC 496 
Ov 78 CATTTTGGAAAAGCAAGAC AAAAAGACAGACAAGGCTTCAGAAGAAGTGTC 128 

M I Ml I II Mill II ■ Mill III! MUM I III II MM II 

• 97 TGTTCTGGAAAAGCAAGACATTT'CTGATAAAAAAGCAGAAAAGGCTACAGAAGAAGTTTC 556 

-•9 TAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAAGCCCCAAC 188: 

Ml Ml M I I M I Mill II III I Ml Mill M M II I II I II ■ 
■ .77 CAAAAATCTGGTTGCCATGAAAGAAA.TTCTGTATGGCACAAATGAAAAAGAGCCTCAGAC 6 16. 

• 9 ^GAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGAT 248": 

' Ml ill I II Mill II II Ml! I MM II I Mill M II MUM 

:V7 AGAAGCAGTAGCTCAACTTGCTCAAGAACTCTA.TAATAGTGGGCTCCTTAGCACCCTGGT ' 6 IS. 

• 9 /'GCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAA 3 08. 

" Tin || '| HIM I 111 III II ill I I M Ml 1 1 Ml I It II M I II II 

6^7 AGCTGATTTACAGCTCATTGACTTTGAGGGCAAAAAAGACGTGGCTCAAATTTTCAACAA 736'- 



Qy 
Db 



Db 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 857 



n 9 CATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGTGCTCATCC 3 6 8 

|| I II II I III II M II I I IIIIMIM II II II I I M I 

737 TATTCTCAGAAGACAAATTGGTACGAGAACTCCTACTGTTGAATACATCTGCACCCAACA 796- 
v 9 TCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTACGTTGTGG 428 

"[ Mil MM III I I Mill MUM I III Ml II II II MM 

7 97 GAATATTTTGTTCATGTTATTGAAAGGGTATGAATCTCCAGAAATAGCTCTAAATTGTGG 856- 
,oc. gattaTGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTCTTTTCTAA 4 88 

|| III I II Mill II III I II III I II II I Ml MM II II M I 

AATAATGTTAAGAGAATGCATCAGACATGAACCACTTGCAAAAATCATTTTGTGGTCGGA 916 
»9 TCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCTTCAGATGC 548- 

IM IIIIMIM III III I i I II 1 1 1 II 1 1 ! II I III 1 1 II I II 

1 7 ACAGTTTTATGATTTCTTCAGATATGTCGAAATGTCAACATTTGACATAGCTTCAGATGC 976 
5 ,-c CTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGACTTCTTAGA 608. 

"' Mill II II M II M I II I! I II IMIIIMI M I MM M M II 

Db 977 AT TGCCACATTCAAGGATTTACTTACAAGACATAAATTGCTCAGTGCAGAATTTTTGGA 1036 



Qy 

Db 



M9 — x f o r ™^ - 8 

103 7 iU^^ 1096 

(3 3 TACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAG^ ^28 

Qy 729 c~_^^ 788 

rb ■ iU^^ 1216 

1217 iiciiUUcGcii^ 1276 
OV .49 TCCTCACAAAACACAGCCTATTGTGGAGATCCTGTT 
. TCCTAACAAGACGCAGCCCATCCTAGAG 

909 GTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGT^ ?68 

,397 CTATTTAGTTAAACAGA^ "37 



Qy 

Db 

Qy 

Db 

Qy 

Db 

cy 

Db 

Qy 

Db i ■■?■'-'■/ 

Qy 

Db 



D' 



. Uii r 



6 8 H 

AAF30688 standard; cDNA; 1053 BP. 



VD 
XX 

AC A:^3.0b83;- - 
X". 



pnn. ii-JUN-2001 '(first entry) 

KW therapy; diagnosis; vaccine; ss. 

Q£ Homo sapiens . 

Kev Location/Qualifiers 
FT CDS 1..1053 
» T /*tag= a 

/products "Human ANIC-BP-1B" 

XX 

PN WO20012-5423-A1. 

XX 

FD 12-APR-2001. 
XX 



PF 28-S2P-2000; 2000WO-EP09475 . 
XX 

PR 04-OCT-1999; 99EP- 01 19113 . 
XX 

PA (MERE ) MERCK PATENT GMBH . 
XX 

PI Duecker K, Den Daas I; 
XX 
DR 



WPI; 2001-266306/27 . 
DR P-PSDB; AAB20387. 
XX 
PT 
PT 
PT 

PT cord injury - 
XX 

PS Claim 4; Page 43-44; 4 9pp; English. 

XX 

• CC 



CC 
CC 



cc 

CC 



Novel human acute neuronal induced calcium-binding protein like protein 
solice variant, useful for treating stroke, acute head trauma, 
Parkinson's disease, Alzheimer's disease multiple sclerosis, spinal 



The oresent sequence is that of cDNA encoding a novel human acute 
neuronal induced calcium binding protein-like protein splice 
variant, ANIC-NP-1B (see AAB20387) . The protein shows homology to • 
o^her members of the calcium binding protein family, including ^ ■ 
v.ntic-BP a protein discovered by mRNA differential display that is 
^regulated in a rat model of head trauma. ANIC-BP and ANIC-BP-1B ; 
'■"^-Ffer in their C- terminal portions. The variant protein could, 
v^ve as a novel drug target. The invention provides ANIC-BP.- IB r - 
polynucleotides and polypeptides, expression vectors, host cells 

-,h antibodies, as well as methods for producing the protein and .h:,.-. 
• ,r t rearing or preventing disorders associated with expression of - 
; proVein by inhibiting or activating the action of ANIC-BP -IB. <_ : .. 
• i,ease^ >bat may be treated include stroke and acute head trauma, ,;. 
ArkiirHon's disease, Alzheimer's disease , multiple sclerosis and 
Voinal ::ord injury. The polynucleotides and polypeptides can .ax so ... 
■ - used in diagnostic assays and in vaccines, and to identify 

agonists and antagonists useful for treating conditions associated 
: v/ith ANIC-BP- IB imbalance. 

I Sequence 1053 BP; 357 A; 211 C; 214 G; 271 T; 0 other; : 

Query Match 53.4%; Score 541.6; DB 22; Length 1053; - . * 

Best Local Similarity 74.1%; Pred. No. 1.5e-140; 

Matches. 716; Conservative 0; Mismatches 239; Indels 11; Gaps • 

18 GTTTAGTAAATCACACAAAAATCCAGCAGAAATtGTGAAAATCCTGAAAGACAATTTGGC 7 7 

I I | I j | || i | | || | | I II I I I I I I ! I I I 1 1 1 i I I I I 1 1 1 1 I Nil 
Db 12 GTTTGG 



Qy 



: G AAGTCTCACAAATCTCCAGCAGACATTGTGAAGAATCTGAAGGAGAGCATGGC 7 1 



7 8 CATTTTGGAAAAGCAAGAC AAAAAGACAGACAAGGCTTCAGAAGAAGTGTC 12 8 

ii run iii linn 1 1 1 1 1 1 1 1 1 i.i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 it 

72 TGTTCTGGAAAAGCAAGACATTTCTGATAAAAAAGCAGAAAAGGCTACAGAAGAAGTTTC 131 

1"9 TAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAAC 188 

mi in || llllllllllllllll III Mill II HIM ! I I II 

Db 132 CAAAAATCTGGTTGCCATGAAAGAAATTCTGTATGGCACAAATGAAAAAGAGCCTCAGAC 191 

ftv 189 AGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGAT 248 

|| Ml Ml I Mill II II II II Mill Ml HIM II il M Ml I 



Qy 



Qy 



Db 

QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



192 AGAAGCAGTAGCTCAACTTGCTCAAGAACTCTATAATAGTGGGCTCCTTAGCACCCTGGT 251 

2 , 9 AGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCA^^ 308: 

30, catcttgagaagagagatagggag™ -b 

31? tattctcagaagacaaattggtacgagaa -i 

359 — c_^^ «• 

372 qaATATTTTGTTCATCT 431 

«9 GATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATC 4BB 

43? JWWJ^^ 491 

, 32 ACAGTTTTATGATTTCTTCAGATATGTCG 551 
, ■ "49 CTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGACTTCTTAGA 608 

" Ml II II M II I I I I I I I I I I II III IIIM •' M II.M II I' 

, . agaaaattaggacactattttxgaagagtatgagaaa™ • 
mi I I l I II I I I II I I 11 I I II ! I I I 1 S I I - 

Db ' .:. , IcAGCATTATGATAGATTTTTCAGTGAATATG^ ™ < ' 

Z ^ tactaagagacagtctttaaagctgctaggggagctg 728 ^ 

:,, - • , 7 , gaCAAAAAGACAGTCACTC ^ * 

^ . 72S CATCATGACAAAGTATATCAGCAAGCCGGAGAACC 788 

it i i it I M I II Mill I I I II I I I I I 11 11 

732 iUl G ACA^^ 791 

. 89 GGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTT^ 8*8 

HS , , I I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 M M Mil 



Db 

Qy 

Db 



Db 

•Qy 
Db 



Ov 9GV AACTAC 972 

Ml 

Db ?7 2 GAATTC 97 7 



RESULT 13 
AAS^9557 



13-FE3-2002 (first entry) 



Human; chromosome mapping; gene mapping; gene therapy; forensic; 
food supplement; medical imaging; diagnostic; genetic disorder; ss. 



: P 



ID AAS39557 standard; cDNA; 1162 BP. 
XX 

AC AAS89557; 
XX 
DT 
XX 

DE DNA encoding novel human diagnostic protein #25361 
XX 
KW 
KW 
XX 

OS Homo sapiens . 
XX 

PN WO2GC175067 A2 . 

XX. 

PD ll-CCV-2001 . 
XX 

PF 30-MAR-2001; 2001WO-US08631-. 
XX 

?R 31-MAR-2000; 2000US-0540217 . 

PR 23-AUG-2000; 2 000US - 064 9167 . 
XX 

?A (HYSE-) HYSEQ INC. 

XX 

PI Drmanac RT, Liu C, Tang YT; 

XX 

WPI; 20C1-639362/73 . 

;;*> P-P3DB, ABG25370 . ^ 

XX • , _ _ / 

PT New isolated polynucleotide and encoded polypeptides, userul in 

diagnostics, forensics, gene mapping, identification of mutations 
responsible for genetic disorders or other traits and to assess 

:>: 1 : biodiversity - 

Claim 1; SEQ ID No 25361; 103pp; English. 

XX 

CC The invention relates to isolated polynucleotide (I) and 

CC polypeptide (II) sequences- (I) is useful as hybridisation probes, 

CC polymerase chain reaction (PCR) primers, oligomers, and for chromosome 

rc and gene mapping, and in recombinant production of (II). The 

CC polynucleotides are also used in diagnostics as expressed sequence tags 

CC for identifying expressed genes. (I) is useful in gene therapy techniques 

CC to restore normal activity of (II) or to treat disease states involving 

CC (II) . (II) is useful for generating antibodies against it, detecting or 

CC quantitating a polypeptide in tissue, as molecular weight markers and as 

-C a food supplement. (II) and its binding partners are useful in medical 

CC imaging of sites expressing (II). (I) and (II) are useful for treating 

CC disorders involving aberrant protein expression or biological activity. 

CC The polypeptide and polynucleotide sequences have applications in 

CC diagnostics, forensics, gene mapping, identification of mutations 

CC responsible for genetic disorders or other traits to assess biodiversity 

CC and to produce other types of data and products dependent on DNA and 

CC amino acid sequences. AAS64197-AAS94564 represent novel human 

CC diagnostic coding sequences of the invention. 

CC Note: The sequence data for this patent did not appear in the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo.int/pub/published_pct_sequences. 



XX 



SQ Sequence 1162 BP; 383 A; 241 C; 258 G; 280 T; 0 other; 

Query Match - 53.2%; Score 539.6; DB 23; Length 1162; 

Best Local Similarity 73.9%; Pred. No. 5.6e-140; 

Matches 743; Conservative 0; Mismatches 249; Indels 14; Gaps 

18 GTTTAGTAAA TCACACAAAAATCCAGCAGAAATTGTGAAAATCCTG^GAC^TTTGGC 77 

y INI I II II MINI II I Mill I II MINI I Mill II I 1 1 1 1 _ 

Db 143 GTTTGGGAAGTCTCACAAATCTCCAGCAGACATTGTGAAGAATCTGAAGGAGAGCATGGC 202- 

Ov -3 CATTTTGGAAAAGCAAGAC AAAAAGACAGACAAGGCTTCAGAAGAAGTGTC 12 8 • 

* || M I || || | | | I I II I I I I I I I I I I I I M I I I M M M I I M 

Db 203 TGTTCTCMAAAAGCAAGACATTTCTGATAAAAAAGCAGAAAAGGCTACAGAAGAAGTTTC 262 

-v • 9 TAAATCA^TGCA/.GCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAAC 1 8 8 

• m ill II Mill MM II Ml 1 1 III Mill II MM I - 

Db 263 CAAAAATCTGGTTGCCATGAAAGAAATTCTGTATGGCACAAATGAAAAAGATCCTCAGAC 322 

v g9 AG ^ GCAGTGGC TCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGAT 248 ■ 

II I II II I II 1 1 1 M 1 1 M 1 1 II II 1 1 1 I 1 1 II I 1 1 II I 1 1 Mil 

523 AGAAGCAGGAGCTCAACTTGCTCAAGAACTCTATAATAGTGGGCTCCTTATCACGCTGGT 382' 
'-.•9 AGCTGACCTGCAGGTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAA 308- 

" ; ";|!||| | Mill II III MUM II Mill II I Ml I MM II I MM 

v, GC TGATTTACAGCTCATTGACTTTGAGGGCAAAAAAGACGTGGCTCAAATTTTCAACAA 442= 
,/,, ^ A TCTTGAGAAGACAGATAGGGA-CTCGGAGTCCTACTGTGGAGTATATTAGTG : eTCATG 367:. 

~|| | 1 1 1 1 1 1 M II M I ! M IIMIIlii II II M I ! 1 1 

4,3 TATTCTCAGAAGACAAATTGGTAGCGAGAACTCCTACTGTTGAATACATCTGCACCCAAA 502: 
.': - fi -'''TCA- - XATCGTGTTTATGCTCCTCAAAGGATATGAAGCCCCAGAGATT- -GCCTTfiCGT 423 

" [ i | MM III I I MIM Ml II I Ml M Mil 

„;>3 r;AGAATATTTTTGTTCATGTTATTGAAAGGGTATGAATCTCCCAGAAATAGCTCTAAATT 56*': 
..:."4 TGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCGTCTTT 4 83, 

^ him il Ml I I II Ml 1 1 II MUM II I Ml I M Ml II MM II 

563 tgTGGaAtAATGTTAAGAGAATGCATCAGACATGAACCACTTGGCAAAATCATTTTGTGG 522 

Ov 484 TCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCTTCA 543 

Q/ || | || || I I I I 1 II I I M M II II I I II I M II II II MMM 

nb • , S23 tc^aagAGTTTTATGATTTCTTCAGATATGTCGAAATGTC-AACATTTGACATAGCTTCA 6 82 

544 GA TGCCTTTGCTACTTTCAAGGATTTACTAACCAGACArAAAGTGTTGGTAGCAGACTTC 603 

i . : I I I I I I I I I I I I I I II I I I I I I I I 1 II 



Dl 



Db 



Db 

cy 

Db 

Qy 



S44 GATGCCniWlAU i LtmwH! i inv. . ~ .,,77 I Mill 

Mill Mill II I M 1 1 1 II II I M II I II II 1 1 II M I Mill , . 

Db ,63.3 GATGCATTTGCCAGATTCAAGGATTTACTTAGA 742- 

■ 604 TTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAA^ 663: 

| | | | II I I I II M I MM II I Mill II M M M I M M I II 
7 ,V3 TTGGAACAGCATTATGATAGATTTTTCAGTGAATATGAG AAGTTACTTCATTCAGAAAAT 802, 

5 , v : TATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAAC 72 3 

'Ml! II I! Ml Mill I Mill II MM M Ml M I II Ml 



M M I I I I I I M I I II II M M M M II 

803 TiTGTGACAAAAAGACAGTCACTGAAGCTTCTCGGTGAACTACTACTAGATAGACACAAC 862 
7 24 TTTGCCATCATGACAAAGTATA.TCAGCAAGCCGGAGAACCTGAAACTCATGATGAACCTC 7 8 3 

II | || 1 1 M II II II Mill II M I M II 1 1 1 Ml I II II I ill II i 



Db 363 TTCACAATTATGACAAAATACATCAGTAAACCTGAGAACCTCAAATTAATGATGAACCTG 92 2 

Ov 7P* CTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTTGTG 84 3 

II II II II M I II IIIIIIIIMIIIM IMIMII 1 1 1 1 M 1 1 1 1 1 1 1 1 IM 

Db 923 CTGCGAGACAAAAGTCGCAACATCCAGTTTGAGGCCTTTCACGTTTTTAAGGTGTTTGTA 982 

rv 84- GCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTC 903. 

' " MM MM MM II Mill II I M Mill I II II 1 1 1 II 1 1 1 1 1 1 

9 83 GCCAATCCTAACAAGACGCAGCCCATCCTAGACATC 1042 
904 ATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGACGAG 963 

M Mill II 1 1 M MM II II MINIM MINIM III NUN 

104 3 ATAGAGTTCCTCAGCAAGTTTCAGAACGACAGGACGGAGGATGAGCAGTTTAACGACGAG 1102 
964 AAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCC 100 9 

MM Ml II Ml I MINI I I II MINN I I N I 

1103 AAG AC C T ATTT AGTT AAAC AG AT C AGGG ATTTG AAG AG AC C AGC T C 114 8 



Db 
Qy 
Db 

Gy 
Db 



"RESULT 14 ' ' " 

AAX39818/C 

ID- AAX39818 standard; DNA; 833 BP. 

XX 

AC AAX3 9318; 

XX 

DT d:>-JV\i-:i99b (first entry) 

Gastric cancer associated gene. :>H 

Saucer associated antigen; diagnosis; research; treatment; human; 
breast cancer; colon cancer; gastric cancer; renal cancer; lung cancer, 
K$ prostate cancer; ss . • "? ■ • 

ci. Hemo sapiens. 

?N V.-O9.9042 65-A2 . 
XX 

PD 2Z -JAN- 1999. V . 



PF 17,-dUL - 1998 ; 98WO-US14 67 9 . 

XX . 

P-a 2-: -JUN -1998; 98US - 01023 22 . 

PR 17- JUL- 1997; 97US - 0896164 . 

^PR, 10rOCT-1997; 97US- 00615 99 . 

PY> 10- OCT -1997; 97US - 0061765 . 

PR 10- OCT- 1997; 97US - 094 87 05 . 

PR 11 OCT - 1997 ; 97GB-0021697 . 
XX 

PA . (LUDW-) LUDWIG INST CANCER RES. 

pr Chen Y, Gout I, Cure A, O'Hare M, Obata Y, Old L J; 

PI -Pfreur-dschuh M, Sahin U, Soanlan MJ, Stockert E; 

PI Turecl O; 
XX 

DR WPI; 1999-132448/11. 
XX 

PT New isolated cancer associated nucleic acids and polypeptides 



isolated using sera from cancer patients, used to develop products 
for the diagnosis, monitoring or treatment of cancers 



PT 

PT 

XX 

PS Claim 67; Page 559; 787pp; English 

XX 

cc 



Th» invention relates to a method for diagnosing a disorder <^acter»ed 
CC bv expression of a human cancer associated antigen precursor coded tor by 
CC a nucKic acid molecule (NAM). The method comprises: (a) contacting a 
CC biological sample isolated from a subject with an agent that specifically 
CC binds 9 to the NAM, an expression product or a fragment of an expression 
CC Product complexed with an HLA molecule; and (b) determining the 
CC Attraction between the agent and the NAM or the expression product as a 
CC determination of the disorder. The products and methods can be used m 
CC tne Sagnosis, monitoring, research, or treatment. of conditions _ 
cr characterised by the expression of various cancer associated antigens^ 
Te invention provides nucleic acid sequences and encoded polypeptides 
whicHre cancer associated antigen precursors ^^f^ 
cancer, renal cancer, colon cancer, gastric cancer, prostate cancer and 



CC 
CC 
CC 

CC lung cancer. 



XX 
SQ 



Sequence 333 BP; 253 A; 171 C; 172 G; 227 T; 10 other; 

Q Ma „ ch 51.3%; Score 52 0.2; DB 20; Length 833; 

P.est Local Similarity 98.1%; Pred. No. I.2e-134; _ _ 

retches 566; Conservative 0; Mismatches 7; indels 4; .aps 4, 

,42 GAATGTATTCGACATGAACCACTTG - CCAAAATC ATCCTC - ^T^T^^T'T?^?^? *" 

Till mil ill Ml Mill! Mill I Mill IMMIMMMIMM 

732 IaATNTATTCGACTTGACCCANT^ 
- ' atTTCTTTAAGT-ACGTGGAGTTGTCAACATTTGATATTGCTTCAGATGCCTTTGCTACT 558 ... 

672 IttTCTTTAAG^ 613 
559 TTCAAGGATTTACTAACCAGA-CATAAAGTGTTGGTAGCAGACTTCTTAGAACAA^ 617 

TuUNIIIIIIIIIII II I i I r M 1 1 1 1 i 1 1 M 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 

3b 612 ttcaaggatttactaaccnga 553 

rr a r a c^ ATTTTTGAAGACTATGAGAAATTGCTTC AGTCTGAGAATTATGTTACTAAG AG 677 
Qy 618 CGACA. ; ATTTT|GAA ACT ,,,, t ,,,,,,,,,,,, | , | | , | | ,, | ,,,, | , M | | | | " 

»■ ' 552 CGACACTATTTTTC 493 

•6-8 ACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATGAC 737 

Qy r 8 ntMIIMMMMIIIMIIIIIIIIIMIIIIIMMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

§ <_ )2 icAGTCTTTAAAGCTGCTAGGGGA^ ^ 
7,8 AAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAACCTCCTTCGGGATAAAAG 797 

mill II ill Mill MUM IMIIMIII II II II MM MIMI Mill Ml III 

432 AAAGTATATCAGCAAGCCGGAGAACCTC 373 
798 TCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTTGTGGCCAGTCCTCACAA 857 

T^T m 1 1 1 1 1 1 1 1 II! 1 1 1 ! 1 1 1 1 ! 1 1 1 1 i 1 1 III I M 1 1 II ! 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

^ TCC AACATCCAGTTTGAAGCC 313 



Db 

Qy 
Db 



Db 
Qy 

Db. 



AACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAG 917 
' M^ MM I I M I II I I I II I II I I I II II I II I M II II I II I M I I I I I II M I I! I I I 
J- 9 zJ^CACAGCCTATTGTGGAGATCC^ 253 



Qy 

Db 
Qy 

Db 



„„ GAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGAC^ ™ 
192 TAjJ^CAGATCCGAGACTTC 



RESULT 15 

AA338031 ^ „^ 

ID AAS88031 standard; cDNA; 2492 BP. 



XX 



XX 
DT 



AAS8iJ031; 

■15 --FEU -2 002 (first entry) 
BE ^ encoding novel human diagnostic protein #23835. 



XX 
KW 
KW 

XX 

OS Eoreo sapiens . , 
XX 

00 175-06 7 -A2 

XX 1- . 

:pd. ; jci:- 2001. : ■, 

"vJ^ •;, - VAR -2 001;. 200 1WO-US 0 8 6 3 i . . v ". 

1 -MAP -2 000; 2000US -0540217 . , t 

i3- AUG-2000 : 2 000US ; 064 9167 . 

XX • 
PA (HYSE-) HYSEQ INC. 

XX 

PI tfrmanac. RT, Liu C, Tang YT; 

XX 

r,o 2001-639362/73. 

Di> P-PSDB; ABG23844. 

S „.» isolated polynucleotide aed encoded P£Yg£^ Mutations 

PT biodiversity -. ■ 
XX 
PS 

XX i *- M iQolated polvnucleotide (I) and 

CC The invention relate, to xsolated jpoiy^ _ hvbridiBat ion probes, 

cc 
• cc 
cc 



cc 
cc 
cc 
cc 



Claim -i; GEQ ID No 23835; 103pp; English. 

i *-^r, iQol^ted Dolvnucleotide (I) and 
inV tide°(Ii? sequences (^^useful as hybridisation probes, 

fcS reaction (PGR) primers oligomers and for chromosome 
S/qene mapping, and in recombinant Product xon of g 
polynucleotides are also used in S J\ X fthera y techni q u e! 

for identifying ex P-sse dg e 3. I) xs useful .^.^ 

^.f 8 aJT iHsefui fot generating antibodies against it, detecting or 
q u anti LV peptide in tissue, as molecular weight markers and as 



CC a food supplement. (II) and its binding partners are useful m medical 
CC imaging of sites expressing (II). (I) and (II) are useful for treating 
CC disorders involving aberrant protein expression or biological activity. 
CC The polypeptide and polynucleotide sequences have applications in 
CC diagnostics, forensics, gene mapping, identification of mutations 
CT responsible for genetic disorders or other traits to assess biodiversity 
CC and to produce other types of data and products dependent on DNA and 
CC amino acid sequences. AAS64197-AAS94564 represent novel human 
CC diagnostic coding sequences of the invention. 

Note- The sequence data for this patent did not appear in the printed 
specification, but was obtained in electronic format directly from WIPO 



CC 

CC h- ■■— ~>- • — — 

CC at ftp.wipo.int/pub/published_pct_sequences 

XX 
SO 



Sequence 2492 BP; 751 A; 477 C; 546 G; 718 T; 0 other; 



Query Match 48.9%; Score 496; DB 23; Length 2492; 

Best Local Similarity 73.0%; Pred. No. l.le-127; 

Matches 737; Conservative 0; Mismatches 255; Indels 18; Gaps. /; 
^ in GTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATCCTGAAAGACAATTTGGC 77 

1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 1 M 1 1 1 i 1 1 1 1 1 I M 1 1 1 1 , 

GTTTGGGAAGTCTCACAAATCTCCAGCAGACATTGTGAAGAATCTGAAGGAGAGCATGGC 202 
73 -ATTTTGGAAAAGCAAGAC AAAAAGAC AGAC AAGGC TTC AG AAG AAGTGT C 128 

"llllillllllillll Mill MM MM II I I III II I I Ml 

2 0 3 TGTTCTGGAAAAGCAAGACATTTCTGATAAAAAAGCAGAAAAGGCTACAGAAGAAGTTTC 2 6.2 



Db 143 
Ob 



• ■ 9 , : ,7 iAATCACTGCAAGCAA l i'GAAAGAAATTCTGTGTGGTACAAACGAGAAAGAACCCCCAAC: 188, 

'ill III ! I MIIIIMM I Mill III Mill II Mill M I il 

; : ; CAAAAATCTGGTTGCCATGAAAGAAATTCTGTATGGCACAAATGAAAAAGATCGTCAGAC 32:2- 
" 19 iyGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGAT 24 8- 

' ■ IN M l|j Mill I MM I II MM II II I MM! II II I II III I 

Db 323 aGAAGCAGGAGCTCAACTTGCTCAAGAACTCTATAATAGTGGGCTCCTTATCACCCTGGT 3 82 

r, ->4 9 aGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATATTTAACAA 3 0.8 

y ~~ (Mill i mil ii i ii linn ii ii linn in i n n n inn 

Db 383 aGCTGATTTACAGCTCATTGACTTTGAGGGCAAAAAAGACGTGGCTCAAATTTTCAACAA 442 

nv ^0 9 CATCTTGAGAAGACAGATAGGCA-CTCGGAGTCCTACTGTGGAGTATATTAGTGCTCATC 3 67 

1 ' ii i mill ii ii ii i i i i ii iiiiii i ii ii ii i. i ii . 

Db , 43 t _ TTCTCAGAAGACAAAT tgGTACCGAGAACTCCTACTGTTGAATACATCTGCACCCAAA 502 

p - « , g 8 CTC A - - tATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATT - - GCCTTACGT 4 23 

i i i mi in i i inn iiiiii i ii ii ni i 

Db b03 CAGAATATTTTTGTTCATGTTATTGAAAGGGTATGAATCTCCCAGAAATAGCTCTAAATT 562 

nv *2A TGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTCTTT 4 83 

QY | | |M II III I I I I I I M I II III II III III II n 

Db 563 TGTGGAATAATGTTAAGAGAATGCATCAGACATGAACCACTTGGCAAAATCATTTTGTGG 62 2 

r „ a 34 TCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCTTCA 543 

1,7 ii inn inn iii i ii n n minii mil m iiiiii 

Db 62 3 TCGGAACAGTTTTATGATTTCTTCAGATATGTCGAAATGTCAACATTTGACATAGCTTCA 682 

c„ 5A4 GATGCC - TTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGACTT 602 

w ' | 1 1| I i I 1 1 M I II II M II II I III II M I 1 1 I M M I M 



Db 

Qy 

Db 



Db 683 GATGCCATTTGCCACATTCAAGGGTTTACTTACAAGACATAAATTGCTCAGTGCAGAATT 742 

743 TTTGGAACAGCATTATGATAGAT^ 802 
663 TTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCOT 722 

803 •» 

3,3 UIcaUJt^^ 922 
^ CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGG - TGTT 339 ■ 

uy mi i M ii miii ii nil i inn ii i ii inn nun in in 

,23 CTGCTGCGAGACAAAAGTCGCAACATC 982 
QY 840 TGTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCT 899 

7 ii i 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 Ml II I II 1 1 1 1 

983 UUccUccUi^ 1042 
9 00 ACTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGA 9S9 

i i i i i I M I II II II I II I ! I M II N 

1043 A.CTCATAGAGTTCCTCAGC?^ 1102 
960 CGAGAAGAACTACTIGATTAAACAGATCCGAGACTTCJAAG 1009 

y i m m ii i n i n 1 1 1 1 n m n n n i i n i 

,,03 CGAGAAGACCTATTTAGTTA^ 1152 



Db 

Qy 



, Db 



completed: January 6, 2 004, 01:28:41 
v .ob "ime ; 3 94 isecs 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: January 6, 2004, 01:16:12 ; Search time 94 Seconds 

(without alignments) 
4761.303 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-10-088-872-1 
1014 

1 atgaaaaaaatgcctttgtt tgaagaaaacggccccttga 1014 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



569978 seqs, 220691566 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1139956 



Database 



Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/l/ina/5A_COMB. seq: * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB.seq: * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB. seq: * 

4 : /cgn2_6/ptodata/l/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB. seq: • 

6 : /cgn2_6/ptodata/l/ina/backf ilesl . seq: ' 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 





1 


1014 


100. 


0 


1421 


4 


US- 


09- 


620 


-312D-111 


Sequence 


111, App 




2 


1010. 8 


99. 


7 


1344 


3 


us- 


09- 


190 


-965-2 


Sequence 


2, Appli 




3 


1010.8 


99. 


7 


1344 


4 


us- 


09- 


470 


-253-2 


Sequence 


2, Appli 


c 


4 


51.6 


5. 


1 


7218 


1 


us- 


08- 


232 


-463-14 


Sequence 


14, Appl 




5 


37.8 


3. 


7 


1457 


4 


us- 


09- 


214 


-307A-9 


Sequence 


9, Appli 


c 


6 


37.4 


3. 


7 


4103 


4 


us- 


09- 


620 


-312D-390 


Sequence 


390, App 




7 


36.4 


3. 


6 


4533 


3 


us- 


08- 


726 


-214-5 


Sequence 


5, Appli 


c 


8 


35.6 


3. 


5 


577 


4 


us- 


09- 


513 


-057C-20 


Sequence 


20, Appl 


c 


9 


35.6 


3. 


5 


3707 


3 


us- 


09- 


276 


-531-42 


Sequence 


42, Appl 


c 


10 


35.6 


3. 


5 


5714 


4 


us- 


09- 


62 0 


-312D-393 


Sequence 


393, App 




11 


35.2 


3. 


5 


12793 


4 


us- 


09- 


004 


-838-124 


Sequence 


124, App 



1 





12 


35 


3. 


5 


775 


3 


us- 


08- 


961- 


083-89 


Sequence 


89, Appl 




13 


35 


3. 


5 


775 


4 


us- 


09- 


536- 


784-89 


Sequence 


89, Appl 




14 


35 


3. 


5 


1696 


3 


us- 


08- 


961- 


083-217 


Sequence 


217, App 




15 


35 


3. 


5 


1696 


4 


us- 


09- 


536- 


784-217 


Sequence 


217, App 




16 


35 


3. 


5 


3763 


1 


us- 


07- 


792- 


865D-1 


Sequence 


1, Appli 


c 


17 


35 


3. 


5 


13926 


4 


us- 


08- 


961- 


527-5 


Sequence 


5, Appli 


c 


18 


34. 8 


3. 


4 


1413 


4 


us- 


09- 


016- 


434-1382 


Sequence 


13 82, Ap 




19 


34. 8 


3. 


4 


12734 


4 


us- 


09- 


344- 


456-1 


Sequence 


1, Appli 




20 


34.8 


3. 


4 


14078 


3 


us- 


09- 


433- 


262-1 


Sequence 


1, Appli 




21 


34.8 


3. 


4 


14078 


4 


us- 


09- 


702- 


330-1 


Sequence 


1, Appli 




22 


34 . 8 


3. 


4 


14578 


3 


us- 


08- 


859- 


694-1 


Sequence 


1, Appli 


c 


23 


34 . 6 


3. 


4 


64467 


4 


us- 


09- 


803- 


671B-3 


Sequence 


3, Appli 




24 


34.4 


3. 


4 


22481 


4 


us- 


08- 


367- 


8 41A-4 3 


Sequence 


43, Appl 




25 


34.4 


3. 


4 


22481 


5 


PCT 


-US95-07201-43 


Sequence 


43, Appl 




26 


34.4 


3. 


4 


22484 


4 


us- 


09- 


875- 


223-2 


Sequence 


2, Appli 




27 


34 


3. 


4 


3378 


4 


us- 


09- 


328- 


352-4107 


Sequence 


4107, Ap 


c 


28 


34 


3. 


4 


43360 


4 


us- 


09- 


453- 


702B-206 


Sequence 


206, App 


c 


29 


34 


3. 


4 


45325 


4 


us- 


09- 


453- 


702B-261 


Sequence 


261, App 




30 


33. 8 


3. 


3 


189 


4 


us- 


09- 


134- 


001C-1145 


Sequence 


1145, Ap 


c 


31 


33.8 


3. 


3 


4233 


3 


us- 


09- 


056- 


105-27 


Sequence 


27, Appl 




32 


33.6 


3. 


3 


1276 


3 


us- 


09- 


177- 


325-2 


Sequence 


2, Appli 




33 


33. 6 


3. 


3 


1276 


3 


us- 


09- 


411- 


812A-2 


Sequence 


2, Appli 




34 


33. 6 


3. 


3 


1276 


4 


us- 


09- 


590- 


113-2 


Sequence 


2, Appli 




35 


33. 6 


3. 


3 


1347 


4 


us- 


09- 


134- 


001C-1309 


Sequence 


1309, Ap 




36 


33.4 


3. 


3 


1410 


4 


us- 


09- 


328- 


352-124 


Sequence 


12 4, App 


c 


37 


33.4 


3 . 


3 


1508 


1 


us- 


08- 


236- 


311-5 


Sequence 


5, Appli 


c 


38 


33.4 


3. 


3 


1508 


3 


us- 


08- 


457- 


-918-5 


Sequence 


5, Appli 




39 


33.4 


3. 


3 


2643 


4 


us- 


09- 


486- 


072-6 


Sequence 


6, Appli 


c 


40 


33.4 


3. 


3 


18627 • 


4 


us- 


08- 


-961- 


-527-113 


Sequence 


113, App 




41 


33.2 


3. 


3 


378 


4 


us- 


09- 


252- 


-991A-10313 


Sequence 


10313, A 


c 


42 


33.2 


3. 


3 


813 


4 


us- 


09- 


•252- 


-991A-10547 


Sequence 


10547, A 




43 


33.2 


3. 


3 


831 


4 


us- 


09- 


-252- 


-991A-10512 


Sequence 


10512, A 


c 


44 


33.2 


3. 


3 


3253 


4 


us- 


-09- 


■333- 


-214-4 


Sequence 


4, Appli 




45 


33.2 


3. 


3 


8091 


4 


us- 


■09- 


-230- 


-652-1 


Sequence 


1, Appli 



ALIGNMENTS 



RESULT 1 

US-09-620-312D-111 

Sequence 111, Application US/09620312D 
Patent No. 6569662 
GENERAL INFORMATION: 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Asundi, Vinod 
APPLICANT: Zhang, Jie 
APPLICANT: Ren, Feiyan 
APPLICANT: Chen, Rui-hong 
APPLICANT: Zhao, Qing A. 
APPLICANT: Wehrman, Tom 
APPLICANT : Xue, Aidong J. 
APPLICANT: Yang, Yonghong 
APPLICANT: Wang, Jian-Rui 
APPLICANT: Zhou, Ping 
APPLICANT: Ma, Yunqing 
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APPLICANT: Wang, Dunrui 
APPLICANT: Wang, Zhiwei 
APPLICANT: John Tillinghast 
APPLICANT: Drmanac, Radoje T. 

TITLE OF INVENTION: No. 6569662el Nucleic Acids and 
TITLE OF INVENTION: Polypeptides 
FILE REFERENCE: 784CIP2B 

CURRENT APPLICATION NUMBER: US/ 0 9/ 62 0 , 312D 
CURRENT FILING DATE: 2000-07-19 
PRIOR APPLICATION NUMBER: 09/552,317 
PRIOR FILING DATE: 2000-04-25 
PRIOR APPLICATION NUMBER: 09/488,725 
PRIOR FILING DATE: 2000-01-21 
NUMBER OF SEQ ID NOS : 1105 
SOFTWARE: pt_FL_genes Version 1.0 
SEQ ID NO 111 
LENGTH: 1421 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (217) . . (1230) 
US-09-620-312D-111 

Query Match 100.0%; Score 1014; DB 4; Length 1421; 

Best Local Similarity 100.0%; Pred. No. 4.8e-292; 

Matches 1014; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 AT GAAAAAAAT G C CT T T GT TT AGT AAAT C ACACAAAAAT CCAGC AGAAAT T GT GAAAAT C 60 

I I I I M I I I I i I I I I I II I II I 1 1 I I I.I I I 1 1 I 1 1 1 1 1 I I I I I I I I t I I I I I I I I I I I I i 

Db 217 AT GAAAAAAAT GC CTT T GT TTAGT AAAT C AC AC AAAAAT C CAGC AGAAAT T GT GAAAAT C 27 6 

Qy 61 C T G AAAG AC AAT T T G G C CAT T T T G G AAAAG C AAG AC AAAAAG AC AG AC AAG G C T T C AGAA 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 277 C T G AAAG AC AAT T T G G C CAT T T T G G AAAAG C AAG AC AAAAAG AC AG AC AAG G C T T C AG AA 33 6 

Qy 121 GAAGT GT CTAAAT CACT GCAAGCAAT GAAAGAAATT CT GT GTGGT ACAAACG AGAAAGAA 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 1 I I I I I I I I II I II I I I I I I I I I 

Db 337 GAAGT GT C T AAAT C AC T G C AAG C AAT G AAAGAAAT T CT GT GT G GT AC AAAC GAG AAAG AA 396 

Qy 181 CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 397 C C C C C AAC AGAAG C AGT G G CT CAG CT AGCACAAGAACT CT AC AG C AGT G G C CT GCT AGT G 4 56 

Qy 241 AC ACT GAT AGCT GAC CT GCAGCT GATAGACTTTGAGGGAAAAAAAGAT GT GACC CAGAT A 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 457 ACACT GAT AGCT GACCT GCAGCTGATAGACTTT GAGGGAAAAAAAGAT GT GAC C CAGAT A 516 

Qy 301 T TT AACAACAT CTT GAGAAGACAGATAGGCACT CGGAGT CCTACT GT GGAGTAT ATTAGT 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 517 T TTAACAACAT CTT GAGAAGACAGATAGGCACT CGGAGT CCTACT GT GGAGTAT ATT AGT 576 

Qy 361 GCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 577 GCT CAT C C T CAT AT C CT GTTT AT GCT C CT C AAAGGAT AT GAAG C C C C AC AG AT T GC CT T A 636 

Qy 421 C GT T GT G G GAT TAT GCT GAGAGAAT GT ATT CGAC AT GAACC AC T T G C C AAAAT CAT C CT C 4 80 
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Db 637 C GT T GT GG GAT TAT GC T GAGAGAAT GT ATT C GAC AT GAAC CAC T T GC C AAAAT CAT C CT C 696 

Qy 481 TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 54 0 

I I I I I II I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I 
Db 697 TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTC7U\CATTTGATATTGCT 756 

Qy 541 T C AGAT G C C T T T GC T AC T T T C AAG GAT T T ACT AAC C AGAC AT AAAGT GT T GGT AG C AGAC 600 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 757 T C AG AT GCCTTTGC T ACT T T C AAG GAT T T AC T AAC C AGAC AT AAAGT GT T G GT AG C AGAC 816 

Qy 601 T T CT TAGAAC AAAAT T AC GAC ACT AT T T T T GAAGACT AT G AG AAAT T GC T T C AGT CT GAG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 817 T T CT TAGAAC AAAAT T AC GAC ACT ATT T T T GAAGAC TAT G AGAAAT T GC T T C AGT CT GAG 87 6 

Qy 661 AAT TAT GT T AC T AAGAGAC AG T C T T T AAAG C T G C T AG G G GAG C T GAT C C T G GAC C G T CAC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 877 AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCAC 936 

Qy 721 AACTTT GCCAT CAT GAC AAAGT AT AT CAGCAAGC CGGAGAAC CT GAAACT CAT GAT GAAC 780 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I 

Db 937 AACTTTGCCAT CAT GACAAAGTATATCAGCAAGC CGGAGAAC CT GAAACT CAT GATGAAC 996 

Qy 7 81 CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTT 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 997 CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTT 1056 

Qy 841 GTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1057 GTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAA/NAATCAGCCCAAA 1116 

Qy 901 C T CAT T GAGT T T C T GAGCAGC T T C CAAAAAG AAAGGAC G GAT GAT GAG C AGT T C G CT GAC 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I II I I I I II I I 

Db 1117 C T CAT T GAGT T T C T GAG CAG CT T C CAAAAAG AAAGGAC G GAT GAT GAG C AGT T C G CT GAC 117 6 

Qy 961 GAGAAGAAC T ACT T GATT AAACAGATC C GAGAC T T GAAG AAAAC GGCCCCTT G A 1014 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 1177 GAGAAGAACT AC T T GATT AAACAGAT C C GAGAC T T GAAG AAAAC GGCCCCTT GA 1230 



RESULT 2 
US-09-190-965-2 

; Sequence 2, Application US/09190965 

; Patent No. 6071721 

; GENERAL INFORMATION : 

; APPLICANT: Tang, Y. Tom 

; APPLICANT: Guegler, Karl J. 

; APPLICANT: Corley, Neil C. 

; APPLICANT: Gorgone, Gina A. 

; TITLE OF INVENTION: CALCIUM BINDING PROTEIN 

FILE REFERENCE: PF-0635 US 
; CURRENT APPLICATION NUMBER: US/09/190,965 
; CURRENT FILING DATE: 1998-11-13 
; NUMBER OF SEQ ID NOS : 5 
; SOFTWARE: PERL Program 
; SEQ ID NO 2 

LENGTH: 1344 
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TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: - 

OTHER INFORMATION: 3734805 
US-09-190-965-2 

Query Match 99.7%; Score 1010.8; DB 3; Length 1344; 

Best Local Similarity 99.8%; Pred. No. 4.2e-291; 

Matches 1012; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

AT G AAAAAAAT G C CT T T GT T T AGT AAAT C ACACAAAAAT C CAG C AG AAAT T GT G AAAAT C 60 
I | | | | I I I I I I I I II I t I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

AT G AAAAAAAT GCCTTTGTT T AGT AAAT C AC ACAAAAAT C CAG CAG AAAT T GT G AAAAT C 183 

C T GAAAGAC AAT T T G G C CAT T T T G G AAAAG C AAG AC AAAAAG AC AG AC AAG G C T T C AG AA 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

C T GAAAGAC AAT T T G G C CAT T T T G G AAAAG C AAG AC AAAAAG AC AG AC AAG G C T T C AG AA 24 3 
GAAGT GT CTAAAT CACT GCAAGCAAT GAAAGAAATT CT GT GT GGT ACAAACGAGAAAGAA 180 

I I I i I I I I I I I M I I I i I I I I I I I I I I I I I I M I I I ! I I I > I I I I I I I I I I I I 

GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGT ACAAACGAGAAAGAA 303 

CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 24 0 

Mill I I I I I I I I I I 1 i I I I I I I I I I I I I I M I I I I I I I I I i i I I I I I I I I I I I I III 

CCCCCGACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTGGTG 363 

AC ACT GAT AG C T GAC C T G CAG C T GAT AG AC T T T GAG G GAAAAAAAG AT GT G AC C CAG AT A 300 
| | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
AC ACT GAT AGC T GAC CT GCAGCT GAT AG ACT T T GAG G GAAAAAAAG AT GT GAC C C AGAT A 42 3 

TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 360 

I I 1 I I I I M I I I I I II I I ! 1 I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I.I I I I I I 

T T T AAC AAC AT C T T G AGAAGAC AGAT AG G C AC T C G G AGT C C T AC T GT G G AGT AT AT T AGT 4 8 3 

GCT CAT C CT C AT AT C CT GTTT AT GCT C CT CAAAGGAT AT GAAG C C C C AC AGAT T GC CT T A 420 
I I I I I I I I I I 1 i I I I I I I ! I II I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 543 

C GT T GT GG GAT TAT GCT GAGAGAAT GT AT T C GAC AT GAAC C ACT T G C C AAAAT CAT C C T C 4 80 
I | | | | | I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

C GT T GT G GGAT TAT GCT GAGAGAAT GT AT T C GAC AT GAAC C AC T T GC C AAAAT CAT C CT C 603 

T T T T CT AAT C AAT T C AGAG AT T T CT T T AAG T AC GT G G AGT T GT C AAC AT T T GAT AT T G C T 54 0 

I I I II II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I i II I I I M I I I I I I I 

TT T T CT AAT CAATT C AGAGAT T T C T T T AAGT AC GT G GAGTT GT CAAC AT T T GAT AT T GCT 663 
TCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGAC 600 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

T C AGAT G C CT T T GCT ACTT T C AAG GAT T T AC T AAC C AGACAT AAAGT GT T GGT AGCAGAC 723 
TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAG 660 

1 1 1 I i 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I i I I 1 1 1 I i I I 1 1 I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

T T C T TAG AAC AAAAT T AC GAC AC TAT T T T T GAAG AC TAT GAG AAAT T GC T T C AGT CT GAG 783 

AAT TAT GT TACT AAG AG AC AGT CT T T AAAG C T GCT AGGGGAGCT GAT C CT GGAC C GT C AC 720 
I I J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M I I I I M I 
AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCAC 843 



Qy 


i 


Db 


124 


Qy 


61 


Db 


184 


Qy 


121 


Db 


244 


Qy 


181 


Db 


304 


Qy 


241 


Db 


364 


Qy 


301 


Db 


424 


Qy 


361 


Db 


484 


Qy 


421 


Db 


544 


Qy 


481 


Db 


604 


Qy 


541 


Db 


664 


Qy 


601 


Db 


724 


Qy 


661 


Db 


784 
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Qy 721 AAC T T T G C CAT CAT G AC AAAGT AT AT C AG C AAG C C G GAG AAC C T G AAAC T CAT GAT G AAC 7 8 0 

II II ! I I I I I I I i I I I I M I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 844 AAC T T T GC C AT CAT GAC AAAGT AT AT C AGC AAG C C GGAGAAC CT G AAAC T CAT GAT GAAC 903 

Qy 781 CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTT 84 0 

I || I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 904 CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTT 963 

Qy 841 GTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I M I I I I I I I 

Db 964 GT GGCC AGT C CT C AC AAAACAC AGC CT AT TGT GGAGAT C CT GT T AAAAAAT CAGC C C AAA 1023 

Qy 901 C T CAT T G AGT T T C T GAG C AG C T T C C AAAAAG AAAG GAC G GAT GAT GAG C AGT T C G C T GAC 960 

I I I I I I I I I I I I I I ! I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1024 C T CAT T G AGT T T C T G AG C AG C T T C C AAAAAG AAAG GAC G GAT GAT GAG C AGT T C G C T GAC 1083 

Qy 961 GAGAAGAAC TACT T GAT TAAACAGAT C CGAGACTT GAAGAAAACGGC C C CT T GA 1014 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I 

Db 1084 GAGAAGAAC T AC T T GAT TAAACAGAT C C GAGACTT GAAGAAAAC GGCCCCTT GA 1137 



RESULT 3 

US-09-470-253-2 

; Sequence 2, Application US/09470253 

; Patent No. 6365371 

; GENERAL INFORMATION: 

; APPLICANT: Tang, Y. Tom 

; APPLICANT: Guegler, Karl J. 

; APPLICANT: Corley, Neil C. 

APPLICANT: Gorgone, Gina A. 
; TITLE OF INVENTION: CALCIUM BINDING PROTEIN 
; FILE REFERENCE: PF-0635 US 

CURRENT APPLICATION NUMBER: US/09/470 , 253 
; CURRENT FILING DATE: 1999-12-22 

PRIOR APPLICATION NUMBER: 09/190,965 
PRIOR FILING DATE: 1998-11-13 
; NUMBER OF SEQ ID NOS : 5 
; SOFTWARE: PERL Program 
; SEQ ID NO 2 

LENGTH: 1344 

TYPE: DNA 
; ORGANISM: Homo sapiens 
; FEATURE: - 

OTHER INFORMATION: 3734805 
US-09-470-253-2 

Query Match 99.7%; Score 1010.8; DB 4; Length 1344; 

Best Local Similarity 99.8%; Pred. No. 4.2e-291; 

Matches 1012; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 AT GAAAAAAAT GCCTTTGTT T AGT AAAT CAC AC AAAAAT C C AGCAGAAAT T GT GAAAAT C 60 

I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I 1 I I 
Db 124 AT GAAAAAAAT GCCTTTGTT T AGT AAAT CAC AC AAAAAT C C AG C AG AAAT T GT GAAAAT C 183 

Qy 61 C T G AAAG AC AAT T T G G C CAT T T T G G AAAAG C AAG AC AAAAAG AC AG AC AAG G C T T C AG AA 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
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Db 184 CT GAAAGAC AAT T T G G C CAT T T T GGAAAAGCAAGAC AAAAAGAC AGAC AAG G C T T CAGAA 24 3 

Qy 121 GAAGT GT CT AAAT CACT GCAAGCAAT GAAAGAAATT CTGT GT GGT ACAAAC GAGAAAGAA 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 244 GAAGT GT CT AAAT C AC T G C AAG C AAT GAAAGAAAT TCTGTGTG GT AC AAAC GAGAAAGAA 303 

Qy 181 CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 24 0 

I I I I I I I I I I I I I If I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I Ml 

Db 304 CCCCCGACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTGGTG 363 

Qy 241 AC ACT GAT AG CT GAC C T GCAG CT GAT AGACTTT GAGGGAAAAAAAGAT GT GAC C CAG AT A 300 

I I I I I I I I t I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 364 AC ACT GAT AG C T GAC C T G CAG C T GAT AG AC T T T G AGG G AAAAAAAGAT GT GAC C CAG AT A 423 

Qy 301 TT TAAC AACAT CTT GAGAAGACAGAT AGGCACT C GGAGT C CT ACT GT GGAGT AT ATTAGT 360 

I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I i I I I I 

Db 424 T T TAAC AACAT CTT G AGAAG AC AG AT AGGCAC T C GGAGT C CT AC T GT G GAGT AT AT T AGT 4 83 

Qy 361 GCT CAT C CT CAT AT C CT GT T T AT GC T C CT C AAAGGAT AT GAAGCC C C AC AG AT T GC CT T A 420 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 84 GCT CAT C C T CAT AT C CT GT T TAT G CT C CT CAAAGGAT AT GAAGCC C CAC AGAT T GC C T T A 54 3 

Qy 421 C GT T GT GGGATT AT GCT GAGAGAAT GTATTC GACAT GAACCACTT GCC AAAAT CAT C CT C 4 80 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 544 CGTTGTGG GAT TAT GCT GAGAGAAT GT AT T C GACAT G AAC CAC T T G C C AAAAT CAT C C T C 603 

Qy 4 81 TTTTCT7VATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTC7UVCATTTGATATTGCT 54 0 

I I I I I I I I I I I I I I I I II I 1 I I I I I I I I I II I I I I I I I II I I I I I I I I I I I II I I 

Db 604 TT TT CTAAT CAAT T CAGAGAT T T CT T T AAGT AC GT GGAGT T GT CAAC AT T T GAT AT T G CT 663 

Qy 541 T CAGAT G C C T T T GC T ACT T T C AAGGAT T T ACT AACCAGAC AT AAAGT GT T G GT AG C AGAC 600 

I | II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I II I I I I I I II I I I I 

Db 664 T CAGAT GCCTTTGC T AC T T T C AAG GAT T TACT AAC CAG AC AT AAAGT G T T G G T AG CAG AC 72 3 

Qy 601 T T C T TAG AAC AAAAT T AC GAC AC TAT T T T T G AAG AC T AT GAG AAAT T G C T T C AGT C T GAG 660 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I ! II I I I I I I 

Db 724 TT CT TAGAAC AAAAT T AC GAC AC TAT T T TT G AAGACT AT GAGAAAT T GC T T CAGT CT GAG 783 

Qy 661 AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCAC 72 0 

I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 784 AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCAC 84 3 

Qy 721 AACTTT GC CAT CAT GAC AAAGT AT AT CAGC AAGCC GGAGAAC CT GAAACT CAT GAT GAAC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 844 AACTTTGCCAT CAT GACAAAGTATAT CAGCAAGC CGGAGAACCTGAAACT CAT GAT GAAC 903 

Qy 7 81 CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTG7\AGCCTTTCATGTTTTTAAGGTGTTT 84 0 

I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 904 CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTT 963 

Qy 841 G T G G C CAG T C C T C AC AAAAC AC AG C C TAT T GT G GAG AT C C T G T T AAAAAAT CAG C C C AAA 90 0 

I I I I I I I I I I I I I I II I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 964 GT GGC C AGT C C T CAC AAAAC AC AG C C TAT T GT G GAG AT C C T GT T AAAAAAT CAG C C C AAA 102 3 

Qy 901 C T CAT T GAGT TTCTGAGCAGCTTC C AAAAAG AAAG GAC G GAT GAT GAG C AGT T C G C T GAC 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I 

Db 1024 CT C ATT GAGT T T CT GAGCAG CT T C CAAAAAGAAAG GAC GGAT GAT GAGC AGT T C GC T GAC 108 3 
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Qy 961 G AG AAG AAC T AC T T GAT T AAAC AG AT C C GAG AC T T G AAGAAAAC GGCCCCTT G A 1014 

I I i I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

Db 1084 GAG AAG AACT AC T T GAT T AAAC AG AT C C GAG AC T T G AAGAAAAC GGCCCCTT G A 1137 



RESULT 4 

US-08-232-463-14/c 

; Sequence 14, Application US/08232463 

; Patent No. 5670367 

; GENERAL INFORMATION: 

APPLICANT: DORNER, F. 
APPLICANT: SCHEIFLINGER, F. 
; APPLICANT: FALKNER, F. G. 

TITLE OF INVENTION: RECOMBINANT FOWLPOX VIRUS 
NUMBER OF SEQUENCES: 52 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Foley & Lardner 

STREET: 1800 Diagonal Road, Suite 500 
; CITY: Alexandria 

STATE: VA 
COUNTRY: USA 
ZIP: 22313-0299 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/232 , 4 63 
FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/ 07/ 935 , 313 
; FILING DATE: 

APPLICATION NUMBER: EP 91 114 30 0.6 
FILING DATE: 26-AUG-1991 
ATTORNEY/ AGENT INFORMATION: 
; NAME: BENT, Stephen A. 

REGISTRATION NUMBER: 29,768 
REFERENCE/ DOCKET NUMBER: 30472/114 IMMU 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (703)836-9300 
TELEFAX: (703)683-4109 
TELEX: 899149 
INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 7218 base pairs 

; TYPE: nucleic acid 

STRANDEDNESS : single 
; TOPOLOGY: linear 

IMMEDIATE SOURCE: 
; CLONE: pTZgpt-Fls 

US-08-232-463-14 

Query Match 5.1%; Score 51.6; DB 1; Length 7218; 

Best Local Similarity 3.6%; Pred. No. 4.2e-05; 
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Matches 12; Conservative 196; Mismatches 130; Indels 0; Gaps 0; 

Q y l AT GAAAAAAAT GCCTTTGTT T AGT AAAT C AC ACAAAAAT C CAGCAGAAATT GT GAAAAT C 60 

| | | | | | | MMI : : : : ::::::: ::::::: :::::: 
Db 14 56 AAGAGATAGAAGAATTTGGTACRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1397 

Qy 61 C T GAAAGACAAT T T GG C CAT T T T G G AAAAG C AAGAC AAAAAGAC AGAC AAG G C T T C AGAA 120 

Db 1396 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR RR 1337 

Qy 121 GAAGT GT CT AAAT CACT GC AAGCAAT GAAAGAAATT CT GT GT GGTACAAACGAGAAAGAA 18 0 

*■•• • ... • - ... ........ » . . ■ • • 

.... . ... . . ... .. ........ . . •• 

Db 1336 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1277 

Qy 181 CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 24 0 

Db 127 6 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 121 7 

Qy 241 AC AC T GAT AG C T GACCT GC AGC T GAT AGAC T T T GAG G GAAAAAAAG AT GT GACCC AGAT A 300 

Db 1216 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR H^ 7 

Qy 3 01 TT T AAC AAC AT CT T GAGAAGACAGAT AGGC ACT C GGAG 338 

Db 1156 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR HI 9 

RESULT 5 

US-09-214-307A-9 

Sequence 9, Application US/09214307A 
Patent No. 6544516 
GENERAL INFORMATION: 
APPLICANT: NEUTEC PHARMA PLC 

TITLE OF INVENTION: TREATMENT AND DIAGNOSIS OF INFECTIONS OF GRAM POSITIVE 
TITLE OF INVENTION: COCCI 
FILE REFERENCE: PM 259204 

CURRENT APPLICATION NUMBER: US/09/2 14 , 307A 
CURRENT FILING DATE: 1999-01-04 
PRIOR APPLICATION NUMBER: PCT/GB97/ 01 8 30 
PRIOR FILING DATE: 1997-07-07 
PRIOR APPLICATION NUMBER: GB9614274.0 
PRIOR FILING DATE: 1996-07-06 
NUMBER OF SEQ ID NOS : 15 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 9 
LENGTH: 14 57 
TYPE: DNA 

ORGANISM: Staphylococcus aureus 
US-09-214-307A-9 

Query Match 3.7%; Score 37.8; DB 4; Length 1457; 

Best Local Similarity 47.0%; Pred. No. 0.23; 

Matches 150; Conservative 0; Mismatches 167; Indels 2; Gaps 1; 

Qy 4 30 ATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTCTTTTCTAAT 4 89 

I | M I I II ! I I I I I ID I I I 1 I I I I I M 

Db 38 6 AAT AT GAGAACT GT AGT T GAT C GAC CT AG AAC AC AAT AT AAAAAAGT CGT CT T T AAT AAT 4 45 
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Qy 4 90 CAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCTTCAGATGCC 54 9 

III III I I I I I I III II I I I I I I II 

Db 446 TTATTTTATCAATTTAGTAAGGATGCCAACTTTGAACCTATTGCTTGTAGACCCTATCGT 505 

Qy 550 TTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGACTTCTTAGAA 609 

III I I I I I I I I I I I I I I I I I I 

Db 5 06 C CT CAAAC AAAAG G GT CT GT T G AAT CAT T AG CT AAAT T T GT T G AAC AGC GT T T AAGAC C A 565 

Qy 610 C AAAAT T AC G AC AC TAT T T T T GAAGAC TAT G AGAAAT T G C T T C AG T C T G AGAATT AT GT T 669 

I I I I I I I I I II I II I I I I I III I I I I I I I I 

Db 5 66 T AC GAT TAT G AAT T T TAT GAT G C T G — TAGAACTTATTGGGCTAGTAAACGATTTATGTC 623 

Qy 67 0 ACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCC 729 

MM II III M II I I I II I I I I I I I I I I 

Db 62 4 AC GAAT T GAAT C ACT T AGAAAT T T C ACAAGC AAC AGAAC AAC GAC CT AT C GAC GT T T T C A 683 

Qy 7 30 AT CAT GACAAAGT AT AT C A 74 8 

I I II II II I I II 

Db 68 4 ATTATGAAGAAAAAGAACA 7 02 



RESULT 6 

US-09-62 0-312D-390/C 

Sequence 390, Application US/09620312D 
Patent No. 6569662 
GENERAL INFORMATION: 
APPLICANT: Tang, Y. Tom 

Liu, Chenghua 



APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
TITLE OF INVENTION: 
TITLE OF INVENTION: 



Asundi, Vinod 
Zhang, Jie 
Ren, Feiyan 
Chen, Rui-hong 
Zhao, Qing A. 
We h r ma n , T om 
Xue, Aidong J. 
Yang, Yonghong 
Wang, Jian-Rui 
Zhou, Ping 
Ma, Yunqing 
Wang, Dunrui 
Wang, Zhiwei 
John Tillinghast 
Drmanac, Radoje T. 

No. 6569662el Nucleic Acids and 
Polypeptides 



FILE REFERENCE: 784CIP2B 

CURRENT APPLICATION NUMBER: US/09/620, 312D 
CURRENT FILING DATE: 2000-07-19 
PRIOR APPLICATION NUMBER: 09/552,317 
PRIOR FILING DATE: 2000-04-25 
PRIOR APPLICATION NUMBER: 09/488,725 
PRIOR FILING DATE: 2000-01-21 
NUMBER OF SEQ ID NOS : 1105 
SOFTWARE: ptFLgenes Version 1.0 
SEQ ID NO 390 
LENGTH: 4103 
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TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY : CDS 
LOCATION: (104) . . (3493) 
US-09-620-312D-390 

Query Match 3.7%; Score 37.4; DB 4; Length 4103; 

Best Local Similarity 60.2%; Pred. No. 0.53; 

Matches 62; Conservative 0; Mismatches 41; Indels 0; Gaps 0; 

Qy 6 AAAAAT GCCT TT GTTT AGT AAAT CACACAAAAAT CCAGCAGAAATTGT GAAAAT C CT GAA 65 

I I I I I I I I I I I I I I I I I I I 1 M I I I II M M I II 

Db 4091 ACAAAT GAGAAAGTTT CATTTAC CT CAAAAAAAT CCAGGCT AT ACAAAC AGACAACT GAA 4032 

Qy 66 AGACAAT T T GGC C AT T T T GGAAAAG CAAGAC AAAAAGAC AGAC 108 

I I I I III I I I I I I II I I I I I I I I I I I II 
Db 4 031 AG C CAC AT AG GAAAT T T C C GAAAC ACAAAAGAAAAAGT CT CAC 398 9 



RESULT 7 
US-08-726-214-5 

; Sequence 5, Application US/08726214 

; Patent No. 6107076 

; GENERAL INFORMATION: 

APPLICANT: Tang, Wei- Jen 

APPLICANT: Gilman, Alfred G. 

TITLE OF INVENTION: SOLUBLE MAMMALIAN ADENYLYL CYCLASE 
TITLE OF INVENTION: AND USES THEREFOR 
NUMBER OF SEQUENCES: 31 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Arnold, White & Durkee 

; STREET: P.O. Box 4433 

; CITY: Houston 

; STATE: Texas 

COUNTRY: United States of America 
ZIP: 77210 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/726,214 
FILING DATE: Concurrently Herewith 
; CLASSIFICATION: 4 35 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 60/005,498 
FILING DATE: 04-OCT-1995 
ATTORNEY/AGENT INFORMATION: 
; NAME: Highlander, Steven L. 

REGISTRATION NUMBER: 37,642 
REFERENCE/ DOCKET NUMBER: UTSD:450 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (512) 418-3000 

TELEFAX: (512) 474-7577 
INFORMATION FOR SEQ ID NO: 5: 
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SEQUENCE CHARACTERISTICS: 
LENGTH: 4533 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
US-08-726-214-5 

Query Match 3.6%; Score 36.4; DB 3; Length 4533; 

Best Local Similarity 56.8%; Pred. No. 1.1; 

Matches 67; Conservative 0; Mismatches 51; Indels 0; Gaps 0; 

Qy 718 C AC AAC T T T GC C AT CAT G ACAAAGT AT AT CAGCAAG C C G GAGAAC C T GAAACT C AT GAT G 777 

I I I I I I II I I I 1 II I I I I I I I I I I I I I I I I I 1 I I II I I 

Db 2 64 4 C T CAT C GC CAC CAT CAT G CT G GT G CAG GT CAG C C AC AT G GT GAAGC T G AC AC T CAT G C T G 27 03 

Qy 77 8 AACCTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGG 835 

III II I I I I I I I I I I I I I I II III III I I 

Db 2704 CTCGTCACAGGCGCCGTGACTGCCATCAACCTGTATGCCTGGTGTCCTGTCTTTGATG 2761 



RESULT 8 

US-09-513-057C-2 0/c 

Sequence 20, Application US/09513057C 
Patent No. 6433251 
GENERAL INFORMATION: 
APPLICANT: Wagner, et al. 

TITLE OF INVENTION: GENES REGULATING CIRCADIAN CLOCK FUNCTION AND 
PHOTOPERIODISM 

FILE REFERENCE: 1505-54357 

CURRENT APPLICATION NUMBER: US/ 09/ 513 , 057C 
CURRENT FILING DATE: 2000-02-24 
NUMBER OF SEQ ID NOS : 35 
SOFTWARE : Patentln version 3.1 
SEQ ID NO 20 
LENGTH: 577 
TYPE: DNA 

ORGANISM: Lycopersicon esculentum 
US-09-513-057C-20 

Query Match 3.5%; Score 35.6; DB 4; Length 577; 

Best Local Similarity 51.2%; Pred. No. 0.64; 

Matches 83; Conservative 0; Mismatches 79; Indels 0; Gaps 0; 

Qy 4 57 GAACCACTTGCCAAAATCATCCTCTTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTG 516 

I I I I I I I I I II I I Mill I I Ml I II I I I I II I I I 

Db 223 GAC C C AAAT AC C C AAAAC AC AAT CT T T AC AT AGAAAT C AAG AGAT T T CT GAAGC AC AC AG 164 

Qy 517 GAGTTGTCAACATTTGATATTGCTTCAGATGCCTTTGCTACTTTCAAGGATTTACTAACC 57 6 

II I I I I I I I I I III II I II Ml I II II 

Db 163 AAG C AAAAAAGAT GT AT AAT T T C AC AAAAT T AC TAT T AT AT T T T T CT GT GAT CAT GT AAC 104 

Qy 577 AGAC AT AAAGT GT T G GT AG C AGAC T T CTTAGAAC AAAAT T AC 618 

I I I I II I I I I I I M I I I I I I I 

Db 103 AG GC CTT GTT GGT AAG C ACAAT AAT AT GAAGAAAG AG AT T AC 62 



RESULT 9 
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\ 

US-09-276-531-42/c 

Sequence 42, Application US/09276531 
Patent No. 6183968 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Lai, Preeti 
APPLICANT: Hillman, Jennifer L. 
APPLICANT : Yue, Henry 
APPLICANT: Reddy, Roopa 
APPLICANT: Guegler, Karl J. 
APPLICANT: Baughn, Marian R. 

TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF GENES ENCODING 
TITLE OF INVENTION: RECEPTORS AND PROTEINS ASSOCIATED WITH CELL 
PROLIFERATION 

NUMBER OF SEQUENCES: 134 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 
STREET: 3174 PORTER DRIVE 
CITY: PALO ALTO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Word Perfect 6.1 for Windows /MS-DOS 6.2 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/276, 531 
FILING DATE: Herewith 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/079,677 
FILING DATE: March 27, 1998 
CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 
NAME: Lynn E. Murry, Ph.D. 
REGISTRATION NUMBER: 42,918 
REFERENCE/ DOCKET NUMBER: PA- 00 08 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (650) 855-0555 
TELEFAX: (650) 845-4166 
INFORMATION FOR SEQ ID NO: 42: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 3707 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: CERVNOT01 
CLONE: 936117 
US-09-276-531-42 

Query Match 3.5%; Score 35.6; DB 3; Length 3707; 

Best Local Similarity 51.9%; Pred. No. 1.7; 

Matches 80; Conservative 0; Mismatches 74; Indels 0; Gaps 
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Qy 4 45 TGTATTCGACATGAACCACTTGCCAAAATCATCCTCTTTTCTAATCAATTCAGAGATTTC 504 

II I I I I I I I II I I I I I I I I I I I I I I I I I I MM 

Db 3154 T G CT TT CAAAAT GT G GAACAAACT AAAAT AT AAGGCT T T T C T GAT AAACT AT AAAAAT T T 3095 

Qy 505 TTTAAGTACGTGGAGTTGTC7\ACATTTGATATTGCTTCAGATGCCTTTGCTACTTTCAAG 564 

I II II INI I I I I I I I I II I I I I I I I I I 

Db 3094 AAT CAGCACTTGGAT CTAAT GACATATCTTTATAAT ACTT C CT CT GC AG AT AC ATT C ACT 3035 

Qy 565 GAT T T AC T AAC C AGAC AT AAAG T G T T GGT AG C AG 598 

I I I II I I I I I I I I I I I III 

Db 3034 T AGT T C AAAC C T T AAC AT AC AAAGT T AGT CT C AG 3 001 



RESULT 10 

US-09-620-312D-393/C 

Sequence 393, Application US/09620312D 
Patent No. 6569662 
GENERAL INFORMATION: 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Asundi, Vinod 
APPLICANT: Zhang, Jie 
APPLICANT: Ren, Feiyan 
APPLICANT: Chen, Rui-hong 
APPLICANT: Zhao, Qing A. 
APPLICANT: Wehrman, Tom 
APPLICANT: Xue, Aidong J. 
APPLICANT: Yang, Yonghong 
APPLICANT: Wang, Jian-Rui 
APPLICANT: Zhou, Ping 
APPLICANT: Ma, Yunqing 
APPLICANT: Wang, Dunrui 
APPLICANT: Wang, Zhiwei 
APPLICANT: John Tillinghast 
APPLICANT: Drmanac, Radoje T. 

TITLE OF INVENTION: No. 6569662el Nucleic Acids and 
TITLE OF INVENTION: Polypeptides 
FILE REFERENCE: 784CIP2B 

CURRENT APPLICATION NUMBER: US/09/620, 312D 
CURRENT FILING DATE: 2000-07-19 
PRIOR APPLICATION NUMBER: 09/552,317 
PRIOR FILING DATE: 2000-04-25 
PRIOR APPLICATION NUMBER: 09/488,725 
PRIOR FILING DATE: 2000-01-21 
NUMBER OF SEQ ID NOS : 1105 
SOFTWARE: ptFLgenes Version 1.0 
SEQ ID NO 393 
LENGTH: 5714 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME/ KEY: CDS 
LOCATION: (272) . . (4312) 
US-09-620-312D-393 



Query Match 3.5%; 
Best Local Similarity 51.9%; 



Score 35.6; DB 4; 
Pred. No. 2.2; 



Length 5714; 
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Matches 80; Conservative 0; Mismatches 74; Indels 0; Gaps 0; 



Qy 44 5 T GT AT T C G AC AT GAAC C ACT T G C C AAAAT CAT CCTCTTTTC T AAT C AAT T C AGAG AT TT C 504 

|| | | | | | | I II I Mill I I I I II I I I I I I I I I I 

Db 5233 T G C T T T C AAAAT GT G GAAC AAAC T AAAAT AT AAG GCTTTTCT G AT AAACT AT AAAAAT T T 5174 

Qy 505 TTTAAGTACGTGGAGTTGTCAACATTTGATATTGCTTCAGATGCCTTTGCTACTTTCAAG 5 64 

I I I I I I I I I I I I I I I I I II I I I I I I I I 1 

Db 517 3 AAT C AG C AC T T G GAT CT AAT GAC AT AT C T T T G T AAT AC TTCCTCTG C AG AT AC AT T C AC T 5114 

Qy 565 GAT T T ACT AAC C AGAC AT AAAGT GT T G GT AG C AG 598 

III II I I I I I I I I I I I I I I 
Db 5113 T AGT T C AAAC CTT AAC AT AC AAAGTT AGT CT C AG 5080 



RESULT 11 
US-09-004-838-124 

; Sequence 124, Application US/09004838 
; Patent No. 6350933 

GENERAL INFORMATION: 

APPLICANT: Michelmore, Richard W. 
APPLICANT: Shen, Kathy 
APPLICANT: Meyers, Blake 
; TITLE OF INVENTION: Procedures and Materials for 
; TITLE OF INVENTION: Conferring Pest Resistance in Plants 
NUMBER OF SEQUENCES: 140 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew LLP 
; STREET: Two Embarcadero Center, Eighth Floor 

CITY: San Francisco 
; STATE: California 

COUNTRY: USA 

ZIP: 94111-3834 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 004 , 83 8 

FILING DATE: 09-JAN-1998 

CLASSIFICATION: 800 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/781,734 

FILING DATE: 10-JAN-1997 
; ATTORNEY/ AGENT INFORMATION: 

; NAME: Einhorn, Gregory P. 

REGISTRATION NUMBER : 38,44 0 

REFERENCE/ DOCKET NUMBER: 02 3070-07 8 8 10US 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 576-0200 

TELEFAX: (415) 576-0300 
INFORMATION FOR SEQ ID NO: 124: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 12793 base pairs 
; TYPE: nucleic acid 

; STRANDEDNESS: single 
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TOPOLOGY: linear 
MOLECULE TYPE: DNA 
FEATURE : 
NAME/ KEY: 
; LOCATION: 1. .12793 

; OTHER INFORMATION: /note- "RG2S " 

US-09-004-838-124 

Query Match 3.5%; Score 35.2; DB 4; Length 12793; 

Best Local Similarity 47.6%; Pred. No. 4.4; 

Matches 101; Conservative 10; Mismatches 98; Indels 3; Gaps 1; 

Qy 438 GAGAGAAT GT ATT C G AC AT GAAC C AC T T G C C AAAAT CAT CCTCTTTTC T AAT C AAT T C AG 4 97 

I I I I : I : : I I I : : : I : : : : I I I I I I I III III I 1 

Db 5998 GAGARAGWAWGRRRGAKAKARMCSMSYTTGGGATGTGATACTTCTTTTAGGAAAATGGAG 6057 

Qy 4 98 AGATTTCTTTAAGTACGTGGAGTTGTCA ACATTTGATATTGCTTCAGATGCCTTTGC 554 

I I M I I I I II I 1 I I I I I I I I I I I I I I Ml 

Db 6058 T TAT AT C T T T GAT AT T GT AT T T T T T T AAT G T AAT T TAT AT AT T T AAT CAT T T T AGT T T AT 6117 

Qy 555 T AC T T T CAAGGAT T T ACT AAC C AGAC AT AAAGT GT T G GT AG CAGAC T T CT T AGAAC7\AAA 614 

I IN I I I I I I I I I I I I I I I I I I III I I I I I 

Db 6118 AAGT T T TAT T TAT T T T GAT AT GAAAAAAAAAGT C T T T TAT AC AT T G GAT T T AAC AT AAAA 6177 

Qy 615 T T AC G AC AC TAT T T T T G AAG AC T AT G AG AAAT 64 6 

I I I I I I I 1 I I I I I I i I I I 
Db 6178 AT C C AAC AAT AT T AAT C AAAAAGAC C AMAC AT 62 09 



RESULT 12 
US-08-961-083-89 

; Sequence 89, Application US/08961083 
; Patent No. 6159469 

GENERAL INFORMATION: 

APPLICANT: Choi et. al . 
; TITLE OF INVENTION: Streptococcus pneumoniae Antigens and Vaccines 
NUMBER OF SEQUENCES: 4 52 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 

CITY: Rockville 

STATE: Maryland 

COUNTRY: USA 

ZIP: 20850 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 

COMPUTER: HP Vectra 486/33 

OPERATING SYSTEM: MSDOS version 6.2 

SOFTWARE: ASCII Text 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 961 , 083 

FILING DATE: 

CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 
; ATTORNEY/AGENT INFORMATION: 
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NAME: Brookes, A. Anders 
REGISTRATION NUMBER: 36,373 
REFERENCE/ DOCKET NUMBER: PB34 0P2 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (301) 309-8504 
TELEFAX: (301) 309-8512 
INFORMATION FOR SEQ ID NO: 89: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 775 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
US-08-961-083-89 

Query Match 3.5%; Score 35; DB 3; Length 775; 

Best Local Similarity 46.9%; Pred. No. 1.1; 

Matches 143; Conservative 0; Mismatches 160; Indels 2; Gaps 1; 

Qy 22 AGTAAAT CACACAAAAAT CCAGCAGAAAT T GT GAAAAT C CT GAAAGACAATTT GGCCATT 81 

I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 

Db 2 63 AGT C AAC CAT C AGACAAAC C AGCT GAG GAAT CAAAAGTT GAACAAGCAGGT GAAC C AGT C 322 

Qy 82 T T G GAAAAG C AAG AC AAAAAG AC AGAC AAG G C T T C AG AAGAAGT GT C T AAAT C AC T G C AA 141 

I it I I I I I 1 1 I I I I I i 1 i I I I I II 

Db 323 G C G C C AAG AGAAG AC G AAAAG GC AC C AG T C G AGC C AGAAAAG C AAC C AG AAG C T C C T G AA 382 

Qy 142 GC AAT GAAAGAAATT CT GT GT GGT AC AAAC GAGAAAGAAC C C C C AAC AGAAGCAGT GGCT 201 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3 83 GAAG AG AAGGCT GT AG AG GAAAC AC C GAAAC AAGAAG AGT C AACT C CAGAT AC C AAGG CT 4 42 

Qy 2 02 CAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGATAGCTGACCTGCAG 2 61 

I || Mil I I I II I III II I I II I Ml M 

Db 443 GAAGAAACTGTAGAA- - CCAAAAGAGGAGACTGTTAATCAATCTATTGAACAACCAAAAG 500 

Qy 262 CT GATAGACTTTGAGGGAAAAAAAGAT GT GACCCAGAT ATTTAACAACAT CTT GAGAAGA 321 

Mi l II I Mill I II II I II I I I M I I I 

Db 501 TT GAAACGCCT GCT GT AGAAAAACAAAC AGAACCAACAGAGGAAC CAAAAGTT GAAC AAG 560 

Qy 322 CAGAT 326 

I II I 

Db 561 CAGGT 565 



RESULT 13 
US-09-536-784-89 

; Sequence 89, Application US/09536784 

; Patent No. 6573082 

; GENERAL INFORMATION: 

APPLICANT: Choi et . al . 

TITLE OF INVENTION: Streptococcus pneumoniae Antigens and Vaccines 
; NUMBER OF SEQUENCES: 452 

; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 

CITY: Rockville 

STATE: Maryland 

COUNTRY: USA 
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ZIP : 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 

COMPUTER: HP Vectra 486/33 

OPERATING SYSTEM: MSDOS version 6.2 

SOFTWARE: ASCII Text 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/536, 7 84 
; FILING DATE: 30-Oct-1997 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/961,083 

FILING DATE: OCT-30-19 97 
ATTORNEY/AGENT INFORMATION: 

NAME: Michelle S. Marks 

REGISTRATION NUMBER: 41,971 

REFERENCE/ DOCKET NUMBER: PB340P3 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (301) 309-8504 

TELEFAX: (301) 309-8512 
INFORMATION FOR SEQ ID NO: 89: 
. SEQUENCE CHARACTERISTICS: 

LENGTH: 775 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 
; SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

US-09-536-784-89 

Query Match 3.5%; Score 35; DB 4; Length 775; 

Best Local Similarity 46.9%; Pred. No. 1.1; 

Matches 143; Conservative 0; Mismatches 160; Indels 2; Gaps 1; 

Qy 22 AGT AAAT C AC AC AAAAAT C C AG C AG AAAT T GT GAAAAT C C T G AAAG AC AAT T T G G C CAT T 81 

I II I I I I I I I I I I I I I I M I I III I I I 

Db 2 63 AGT CAACC AT CAGACAAACC AG CT GAGG AAT CAAAAGT T GAAC AAG C AG GT GAAC C AGT C 322 

Qy 82 T T G GAAAAG C AAGAC AAAAAG AC AG AC AAG G C T T C AG AAGAAGT GT C T AAAT C AC T GC AA 141 

IN MINIMUM I I I I I I I I II I I II 

Db 323 GC GC C AAGAGAAGAC GAAAAG G CAC C AGT C GAGC CAGAAAAGC AAC C AGAAGCT CCT GAA 382 

Qy 14 2 G C AAT GAAAG AAAT TCTGTGTG GT AC AAAC G AG AAAGAAC C C C C AAC AG AAG C AG T G G CT 201 

II I II I I I I II I I I I I I I I I I I I I I I I 

Db 3 83 GAAG AG AAG G C T GT AG AG GAAAC AC C G AAAC AAGAAG AGT C AAC T C C AG AT AC C AAG G C T 4 42 

Qy 202 CAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTGACACTGATAGCTGACCTGCAG 261 

I II I II I III It I III II I I II I Ml M 

Db 443 GAAGAAACT GT AGAA — CC AAAAGAGGAGACT GTT AAT C AAT CTATT GAAC AAC CAAAAG 500 

Qy 2 62 CT GAT AGACT T T GAGG GAAAAAAAG AT GT GAC C C AGAT AT T T AAC AAC AT C T T GAG AAGA 321 

I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 501 T T GAAAC GC C T GC T GT AGAAAAAC AAACAGAAC C AAC AGAG GAAC CAAAAGT T GAAC AAG 560 

Qy 322 CAGAT 326 

I I I I 

Db 561 CAGGT 565 
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RESULT 14 
US-08-961-083-217 

Sequence 217, Application US/08961083 
Patent No. 6159469 
GENERAL INFORMATION: 

APPLICANT: Choi et. al . 

TITLE OF INVENTION: Streptococcus pneumoniae Antigens and Vaccines 
NUMBER OF SEQUENCES: 4 52 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 
STREET: 9410 Key West Avenue 
CITY: Rockville 
STATE: Maryland 
COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 
COMPUTER: HP Vectra 486/33 
OPERATING SYSTEM: MSDOS version 6.2 
SOFTWARE: ASCII Text 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/961 , 083 
FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
NAME : Brookes, A. Anders 
REGISTRATION NUMBER: 36,373 
REFERENCE/ DOCKET NUMBER: PB34 0P2 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (301) 309-8504 
TELEFAX: (301) 309-8512 
INFORMATION FOR SEQ ID NO: 217: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1696 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
US-08-961-083-217 

Query Match 3.5%; Score 35; DB 3; Length 1696; 

Best Local Similarity 46.9%; Pred. No. 1.7; 

Matches 143; Conservative 0; Mismatches 160; Indels 2; Gaps 1 

Qy 22 AG T AAAT C AC AC AAAAAT C C AG C AG AAAT T G T G AAAAT C C T G AAAG AC AAT T T G G C CAT T 81 

I I I I I I I I I I I I I I I I ! I I I I IN I I I 

Db 275 AGT CAACC AT CAGACAAACCAGCT GAGGAAT CAAAAGTT GAACAAGCAGGT GAAC CAGT C 334 

Qy 82 T T GGAAAAGC AAGACAAAAAG AC AGAC AAGGC T T C AGAAGAAGT GT CT AAAT C ACT G CAA 141 

III I I I I I I I I I I I 1 I I I I I I I I I I I I II 

Db 335 G C G C C AAG AGAAG AC G AAAAG G C AC C AG T C G AGC C AGAAAAG C AAC C AG AAG C T C C T GAA 394 

Qy 142 GCAAT GAAAGAAATT CT GT GT GGT AC AAAC GAGAAAGAAC C CCCAAC AGAAGC AGT GG C T 201 
I 1 I I I I I I I I I I I I I I I I I I ! I 
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Db 395 GAAGAGAAGGCTGTAGAGGAAACACCGAAACAAGAAGAGTCAACTCCAGATACCAAGGCT 4 54 

Qy 202 C AG C TAG C AC AAG AAC T C T AC AG C AGT G G C C T G C T AGT G AC AC T G AT AG CT GAC C T G C AG 2 61 

I li MM III M I M I M I I M I Ml II 

Db 455 GAAGAAACT GT AGAA — C CAAAAGAG GAGACT GT T AAT C AAT CT AT T GAAC AAC CAAAAG 512 

Qy 262 CT G AT AGAC TT T GAG G GAAAAAAAG AT GT GAC C C AGAT AT T T AAC AAC AT C T T G AGAAGA 321 

I M I II I Mill I II I II I II I I I I I I I 

Db 513 T T GAAAC G C C T GCT GT AG AAAAAC AAACAG AAC C AAC AG AG GAAC C AAAAGT T GAAC AAG 572 

Qy 322 CAGAT 326 

Ml I 

Db 573 CAGGT 577 



RESULT 15 
US-09-536-784-217 

; Sequence 217, Application US/09536784 
; Patent No. 6573082 

GENERAL INFORMATION: 
; APPLICANT: Choi et. al . 

TITLE OF INVENTION: Streptococcus pneumoniae Antigens and Vaccines 
NUMBER OF SEQUENCES: 452 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 
STREET: 9410 Key West Avenue 
CITY: Rockville 
; STATE: Maryland 

COUNTRY: USA 
; ZIP: 20850 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 
COMPUTER: HP Vectra 486/33 
OPERATING SYSTEM: MSDOS version 6.2 
SOFTWARE: ASCII Text 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/536, 784 
FILING DATE: 30-Oct-1997 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/961,083 
; FILING DATE: OCT-30-1997 

ATTORNEY/AGENT INFORMATION: 
NAME: Michelle S. Marks 
REGISTRATION NUMBER: 41,971 
REFERENCE/ DOCKET NUMBER: PB34 0P3 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (301) 309-8504 
TELEFAX: (301) 309-8512 
; INFORMATION FOR SEQ ID NO: 217: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1696 base pairs 
TYPE: nucleic acid 
; STRANDEDNESS: double 

TOPOLOGY: linear 
SEQUENCE DESCRIPTION: SEQ ID NO: 217: 
US-09-536-784-217 
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Query Match 3.5%; Score 35; DB 4; Length 1696; 

Best Local Similarity 46.9%; Pred. No. 1.7; 

Matches 143; Conservative 0; Mismatches 160; Indels 2; Gaps 1; 

Qy 22 AGT AAAT CACACAAAAAT CCAGCAGAAATT GT GAAAAT C CT GAAAGACAATTT GGC CATT 81 

I I I I I I I I I I I M I I I I I I I I I > I I I I 

Db 275 AG T C AAC CAT C AGAC AAAC C AG C T GAG G AAT C AAAAGT T GAAC AAGC AG GT G AAC C AGT C 334 

Qy 82 T T GGAAAAGC AAGACAAAAAGACAGACAAG GCT T CAGAAGAAGT GT C T AAAT C ACT GC AA 141 

I II I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 335 GCGC C AAGAGAAGACGAAAAGGCACCAGTC GAGCCAGAAAAGC AAC CAGAAGCT CCT GAA 394 

Qy 142 GCAAT GAAAGAAATT CT GT GT GGTACAAACGAGAAAGAAC C CC C AACAGAAGCAGT GGCT 2 01 

I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 395 GAAGAGAAGGCT GT AGAGGAAACACCGAAACAAGAAGAGT CAACT CCAGAT ACCAAGGCT 4 54 

Qy 202 CAGCTAGCACAAGAACT CTACAGCAGT GGC CT GCT AGT GACACT GAT AGCT GAC CT GC AG 2 61 

I II I I I I III II I I I I I I I I I 1 I Ml M 

Db 455 GAAGAAACT GT AGAA — C C AAAAG AG GAGACT GT T AAT CAAT C TAT T GAAC AAC C AAAAG 512 

Qy 2 62 C T GAT AG AC T T T GAG G GAAAAAAAG AT GT GAC C C AG AT AT T T AAC AAC AT C T T G AGAAG A 321 

I III I I I I I I I I I I I I I I IN I I MM I 

Db 513 TT GAAAC GC CT GCT GT AGAAAAACAAAC AGAAC C AACAGAGGAACC AAAAGTT GAACAAG 572 

Qy 322 CAGAT 32 6 

II I I 

Db 573 CAGGT 577 



Search completed: January 6, 2004, 03:19:48 
Job time : 96 sees 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



CM nucleic- 
Run on: 



- nucleic search, using sw model 
January 6, 2004, 02:35:04 



; Search time 13 94 Seconds 

(without alignments) 

2517.743 Million cell updates/sec 



Title : 

Perfect score: 
Sequence : 

Scoring cable : 



US-10-088-872-1 
1014 

1 atgaaaaaaatgcctttgtt . 



. tgaagaaaacggccccttga 



1014 



I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 2263443 seqs, 1730637950 residues 

Total- number of hits satisfying chosen parameters: 

M i n i -amiTi . DE seq length: 0 

Maximum 03 seq length: 2000000000 



4526836 



Minimum Match 0% 
Maximum Match 100% 
Listinq first 45 summaries 



Database 



Publ i shed_App 1 i c at i ons_ 



NA: 



i 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 



/cgn2_6/ptcdata/l/pubpna/US07_PUBCOMB.seq:^ 
Vcgn2j6/ptodata/:i/pubpna/PCT; NEW _PUB .seq: * 
'/cgrj2__6/ptodata/l/pubpna/US06__NEW_PTJB.seq:* 
/cgn2_6/ptodata/ l/pubpna/US06_PUBCOMB . seq : * 
/cgn2_6 /ptodata/ l/pubpna/US07_NEW_PUB .seq:* 
/cgn2_6/ptodata/l/pubpna/PCTUS_PUBC0MB.seq:* 
/cgn2_6/ptodata/l/pubpna/US08_NEW_PUB.seq:* 
/cgn2_6/ptodata/l/pubpna/US08_PUBCOMB.seq:* 
/ C gn2_6/ptodata/l/pubpna/US09A_PuBC0MB.seq: * 
/cgn2_6/ptodata/l/pubpna/US09B_PUBCOMB.seq:^ 
/cgn2_6 /ptodata/ l/pubpna/US0 9CJPUBCOMB . seq : ' 
/cgn2_6 /ptodata/ 1 /pubpna/USO 9_NEW_PUB . seq: * 
/cgn2_6 /ptodata/ 1/pubpna/US 0 9_NEW_PUB . seq2 : • 
/cgn2 6 /ptodata/l/pubpna/US10A_PUBCOMB . seq : • 
/cgn2^6 /ptodata/l/pubpna/US10B__PUBCOMB . seq: •■ 
/cgn2 6/ptodata/l/pubpna/US10_NEW_PUB.seq:* 
/cgn2V6 /ptodata/ l/pubpna/US6 0_NEW_PUB . seq : * 
/ cgrajS /ptodata/ 1 /pubpna /US 6 O^PUBCOMB . seq : * 



r> r -<* ho is the number of results predicted by chance to have a 
•score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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22 


41 . 6 


4 


1 


z 4 z 


Q 


TTC _ A Q - Q 9 - R7£-9R9R 


Sequence 2 528, Ap, 




2 3 


41.6 


4 


1 


Z 4 Z 


i o 
iz 


TTC-0Q-Q93-R7K-9S9-3 


Sequence 2 528, Ap 




2 4 


-i o . a 


4 


0 


*i n o c 
1 2. z b 


12 


US-10-310-154-294 


Sequence 2.94, App 




25 


4 0 . 2 


4 


. 0 


113 3 06 


12 


US-10-292-798-1007 


Sequence 1007, Ap. 




..'6 


3 9 . 8 


3 


. 9 


/I Q T 


11 


. US -09-918-995-5787 


Sequence 5:787, Ap- 


c 




... ,: . 4 


3 


. 8 


6301 


13 


US-10-311-455 -26 


Sequence 26, Appli: 




- l -> 


2 


3 


. 8 


1 O A A 

Iz U U 


13 


US -10 -027 - 632 -26 12 3 5 , 


Sequence 25.1235,1 




'/ V 


•• * 


3 


.3 


12 0 0 


14 


US -10 -02 7 -63 2 -261235 


Sequence 2 612 3 5,"*:- 




3 0 


3 V . 8 


3 


. 7 


14 57 


15 


US-10-054-968-9 


Seauence 9- Appli* 




■'J 1 


37 .8 


3 


. 7 


717 8 


13 


US-09-873-367C-278 


Seauence 278, App 




3 2 


3 7 .6 


3 


'7 


12 6 7 


14 


US-10-001-843-45 


Spmience 45. AppI 


c 


33 


37.6 


3 


. 7 


3b73778 


13 OS- .10-312-841-1 


Seauence 1, Appl" 




34 


3.7 . 4 


3 


. 7 


2232 


15 


US-10-087-464-45 


C o rri lPTlCe 4 5 APD-1 


rj 


35 


37.4 


3 


. 7 


4012 


9 


US-09-876-889-335 


Sequence 335, App 


C 


?,e 


37 . 4 


3 


. 7 


4103 


13 


US-10-117-722-390 


Sequence 390, App 


v.~i 


3 7 


37.4 


3 


. 7 


4 1 J J 


15 


US-10-037-270-390 


Sequence 3 90, App 




3 8 


37.4 


3 


.7 


8577 


13 


US-10-311-455-1760 


Sequence 1760, Ap 




3 9 


3.7 , 2 


3 


7 


869 


13 


US -10 -027 -632 -26 1978 


Qpmipnrp 9 61 97R 


C 


4 0 


3 7.2 


3 


.7 


869 


14 


US -10-027-6 32 -26197 8 


Sequence 261978,; 


. c 


4 1 


37 


3 


.6 


5413 


13 


US-10-311-455-538 


Sequence 538, App 


c 


•42 


3 6.6 


3 


.6 


9367 


13 


US-10-311-455-944 


Sequence 944, App 




3 


36.4 


3 


. 6 


461 


14 


' US-10-079-623-143 


Sequence 14 3, App 




'c4 


36.4 


3 


.6 


2641 


12 


US -10 -3 69 -493-292 99 


Sequence 292 99, A 


c 


4 £ 


36.4 


3 


.6 


6071 


13 


US-10-311-455-297 


Sequence 297, App 
















ALIGNMENTS 





RESULT 1 

US-10-117-722-111 

; Sequence- 111, Application US/10117722 

; Publication. No. US20030219744A1 

; GENERAL INFORMATION : 

; APPLICANT: Tang, Y. Tom 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Liu, Chenghua 
Asundi, Vinod 
Zhang, Jie 
Drmanac , Rado j e T . 
TITLE OF INVENTION: No. US2 003 02 19744Alel Nucleic Acids and 
TITLE OF INVENTION: Polypeptides 
FILE REFERENCE: 784CIP2BCIP 

CURRENT APPLICATION NUMBER: US/ 10/ 117 , 722 
CURRENT FILING DATE: 2002-04-04 
PRIOR APPLICATION NUMBER: 09/620,312 
PRIOR FILING DATE: 2000-07-19 
PRIOR APPLICATION NUMBER: 09/552,317 
PRIOR FILING DATE: 2000-04-25 
PRIOR APPLICATION NUMBER: 09/488,725 
PRIOR FILING DATE: 2000-01-21 
NUMBER OF SEQ ID NOS : 1104 
SOFTWARE: pt_FL_genes Version 1.0 
SEQ ID NO 111 

LENGTH: 14 21 

TYPE : DNA 

ORGANISM- Homo sapiens 
FEATURE : 
. NAME /KEY: CDS 
LOCATION: (217) . . (1230) 
~>-:\0-117 -722-111 '■ ; " 

every Match 100.0%; Score 1014; DB 13; Length 1421; ■' ' 

iSst Local Similarity 100.0%; Pred. No. 1.2e-281; ■■ - 

KatciV.s .1014; Conservative 0; Mismatches 0;. Indels 0; Gaps.:. 0; 

i ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 60 

* MMMMIMMMMMMIMMMIMMIMIMMMMIMIMMMMIM 

ob 211 ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 276 

, y , 6 v ^TGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 12 0 

iiMIIIIMIMMIIillMIMIIIIIIilllllllllMMIMIIMIMIIIII 

Db 27 7 CTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 3 36 

t 2 t GAAGTGTCTAAATCA CTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 180 

MIIMMMMMMMMMMMMIMMMMMMMIMMMMM 

Db 3 37 GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 



396- 



nv 131 CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 24 0 

•': " MMMIMMMIMIMMIIIMMMMMMMMMMMIMMMMMMI 

■ 3b 3 9 7 c C C C C AAC AG AAG C A.G TGGC T C AG C TAG CA C AAGAACTCTACAG C AGTGGC CTG C T AGTG 4 56 



QV 2 41 



300. 



ACACTGATAG^TGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 

M 1 1 1 1 1 1 1 ! Til 1 1 1 M I M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 1 S 

Db 45 7 ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 516 



Qw TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 3 60 

I Mi Ml II ill! I IIIIMilllillll MIIMIMIIMII IN MIIMIM MM ^ 

Db 517 TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 576 

Gv -j^i GCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 42 0. 

" MIIMM Mill Mill Mill I MUM II II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 MM I III 

Db 577 GCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 636 



491 pptTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 4 80 
Qy 421 C jTTGTGGGATTATGCTG , , , , , , , , , , , , , , , , , , , , , , , , | 

Db 637 CGTTGTGGGATTATGCTC 696 

, 01 TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 540 

: :;: rrr™ - 

757 TCAGATGCCTTTGCTACTTTCA^ 816 
Q , 601 TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTC 660 

Db 317 UctIaGAACA^ 876 

, f1 a ATTATrTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGAGCGTCAC 720- 

07 M 1 1 Ml 1 1 1 MM II I i I M M M I M M M M M M I M M IJ M M 1 1 1 M M ! M I 

Db 877 AATTATGTTACTAAGAGACAGTCTTT 93b 

7-1 AACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAAC 780 

cy 7.i ^[^^ |1||MM11M , | , M ||||,|||||MMM[MMIMMIMI q 

IJiCTTTGCCATCATGACAAAGTATA 996 
.- IGCTTCGGGATAAAAGTCCGAACATCCAGTTTGAAGCCTTTCArGTTTTTAAGGTGTTT 840, 

; mill ' i 1 1 I II 1 1 III 1 1 1 1 M I II I II 1 1 II I Ml I i Ml M II II 1 1 II M II II 
ctccttcgggatIaaagtccca^ 105$ 

r ,r rrr"( TA.GTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAA 90:0, 

i mi imTTTitituTTi 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 m 1 1 1 1 1 

GTGGCCAGTCCTCAC 1116 
301 C : TCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAA(MACGGATGATGAGCAG 960 

i 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M II I M I II 1 1 1 1 M I II I M II II II 1 1 M 

117 CTCATTGAGTTTC 1176 

0,-1 C ag AAGAACT ACTTG ATTAAAC AG ATCCGAGACTTGAAG AAAACGGC CC CTTGA 1014 
Qy „.l GAGAAGAAGTAC G | | I I 1 I I I 1 MIIMMIMM 

1177 gAgUgAACTAC^GATTAAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA 1230 



'£f'b 



Db 



US -10-037-270-111 

; £ eqU ence 111, Application tfS/10037270 

/publication No. US2003010452 9A1 

; GENERAL INFORMATION: 

; APPLICANT: Tang, Y. Tom 

; APPLICANT: Liu, Chenghua 

APPLICANT: Asundi , Vinod 
; APPLICANT: Zhang, Jie 

APPLICANT: Ren, Feiyan 
; APPLICANT: Chen, Rui-hong 

APPLICANT: Zhao, Qing A. 

APPLICANT: Wehrman, Tom 

APPLICANT: Xue , Aidong J - 
; APPLICANT: Yang, Yonghong 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Wang, Jian-Rui 
Zhou, Ping 
Ma, Yunqing 
Wang, Dunrui 
Wang, Zhiwei 
Tillinghast, John 
Drmanac , Rado j e T . 
TITLE OF INVENTION: No. US2003010'4529Alel Nucleic Acids and 
♦ TITLE OF INVENTION: Polypeptides 
; FILE REFERENCE: 784CIP2B 

; CURRENT APPLICATION NUMBER: US/10/037,270 

: CURRENT FILING DATE: 2002-01-04 

; PRIOR APPLICATION NUMBER: 09/552,317 

; PRIOR FILING DATE: 2000-04-25 

,■ PRIOR APPLICATION NUMBER: 09/483,725 

; PRIOR FILING DATE: 2000-01-21 

; NUMBER OF SEQ ID NOS.: 1104 

; SOFTWARE: pt_FL_genes Version 1.0 

; SEQ ID NO 111 

LENGTH: 1421 " 

TYPE: DNA 

ORGANISM: Homo sapiens 

FEATURE.: 

vjAME/KEY : CDS 

LOCATION: (217) (123 0) \ \" f ; 

;is-io--037-;:70-iii ■ • • 

WjLOi.y Match 100.0%; Score 1014; DB 15; Length 1421; ' 

Local Similarity 100.0%; Pred. No. 1.2e-281; 
v;>7.::hes 1014; Conservative 0; Mismatches 0; Indels 0; .Gaps 0.; 

. i ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 60 

' j M l M 1 1 M 1 1 1 1 1 1 M 1 1 I M I i ! M 1 1 1 M I M 1 1 ! 1 M I M M HUM!! 

• J; . - ;• ; 7 ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCc^AGCAGAAATTGTGAAAATC 2 /B 

-TGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 12 0 

IIIIMIIMIIIIIIilllMlillMMIIIIMIIIMMIilMIMIIIIIIMI 

CTGAAAGACAATTTGGCCATTTTGGA AAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 33 6 
GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 18 0 

1 1 M M 1 1 1 1 1 1 1 1 E 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GAAGTGTCTAAATCACTGCAAGCAATG AAAGAAATTCTGTGTGGTACAA ACGAGAAAG AA 3 96: 
CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 2 4 & . 

i M i M 1 1 ! M M 1 1 1 1 1 1 M 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 

CCCCCAACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 4 56 
ACACTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGATA 3 0 0- 

IIIIIIIMMMIIIIIIIIMIMIIIIIilllllllillMMMIMMIMIMI 



QV- 


ol 


Ch 


? 7 7 


QY 

Db 


121 


W ' 




"Ob 




QY 


24-1 


pb 




Qy 


. 301 


Db 


517 


Qy 


3 61 



1 1 1 1 ! 1 1 M I M 1 1 1 II 1 1 M ! M 1 1 1 II 1 1 1 ! 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 M 

TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 
GCTCATCrTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 

IIIIMIIIIMMIMIIIIIIIIMIIIIIIMIMIIIMIIIMIIIIIIIIMM 



420 



Db 577 GCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 636 

a o cGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 4 80 

MMMIIIirLIIIIIIMIIIIIIIIMIIIIIIIIIIIIIMMIIIIIIINIIM fiqfi 

Db 637 cgUgTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 6 96 

4-7 TTTTC TAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 54 0 

mMMIMIIIIMMIIIillllllllllMIIIIMIIIIIIIIMIIIIMIII 

Db 697 TTTTClJJvTCAATT 756 

r^i TrAraTGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGAC 600 

Y IIMM IN MM MM IIIIIIIIIIIMIIMIIMMIIIIIMMIIIIIIMMI 

Db -757 TCAGATGCCTTTGCTACTTTC 816 

f-v 50 1 TTCTTAG^ACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAG 660 

a 17 ttcttagJ^caaaattacgac 876 



Db 

Qy 

Db 

Qy 

Ob 



6 * 1 aatTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCAC 72 0 

mlMllllllMMMMMIMMMMMMMMMMMMMIIMiMMM 

077 jJvTTATGTTTACT^ 936 
— aACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAAC 78 0 

mm mi 1 1| 1 1| 1 1| Ml Mill I Mill MUM 1 1 Mill II I Mill II MUM 

7,7 AACTTTGCCATCATG^ 9 ^ ' 

ny v 7. '■ CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTT^ 84 0, : 

* m m m I M 1 1 1 1 1 1 1| I M I Ml M M I i I M M I i 1 1 M I II II II ! 1 1 M I II I II 

,,, / CTCCTTCGGGATAAAAOT 10b6 " 
o 41 GT GGGCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAA 900 

X057 GTGGCCAGTCCTCACAAAA 1115 
Oil CTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGAC 960. 

n ,01 ^T^TT^^TTT 1 1 m 1 1 ii i ii i ii 1 1 ii 1 1 1 1 ii ii ii ii ii m 1 1 1 M ii 1 1 1 1 1 1 1 

Db 1117 CTCATTGAGTTTCTGAGCAG^ 1176 

Gv- 961 GAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA 1014 

I I | I I I | I | I | II | | | M II i II M M I I II II M I I I M I I I I I M II I I I I I 
n b 1177 GAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCC^ 123 0 



RiZSULT .3 
US-10-025-730-2 

/sequence 2, Application US/10025730 
• Publication No. US20030045466A1 
'• GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Tang , Y . Tom 
Guegler, Karl J. 
Corley, Neil C. 
„ r[JJi „„, 1 , Gorgone, Gina A. 

TITLE OF INVENTION: CALCIUM BINDING PROTEIN 
FILE REFERENCE: PF-0635 US 

CURRENT APPLICATION NUMBER: US/10/025 , 730 

CURRENT FILING DATE: 2001-12-18 

PRIOR APPLICATION NUMBER: US/09/190,965 



Db 

Qy 



Qy 
Qy 

0b 

Qy 

Db 



Do 



PRIOR FILING DATE: 1998-11-13 
NUMBER OF SEQ ID NOS t 5 
SOFTWARE: PERL Program 
SEQ ID NO 2 
LENGTH: 1344 
TYPE : DNA 

ORGANISM: Homo sapiens 
FEATURE : - 

OTHER INFORMATION: 3734 805 
US-10-025-730-2 

Query Match 99.7%; Score 1010.8; DB 15; Length 1344; 

Best" Local Similarity 99.8%; Pred. No. le-280; 

Matches 1012; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

, ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 6 0 

* lllilMilMIIIIIIIIIMIIIIMIillllllSIIIIMMIIM MM 

Db 124 ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 183 

, 6 , CTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 120. 

Q: ' '* || 1 1 1 1 1 1 M M M M MIMMIIMIMMIMMMIMMMM 

184 CTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGAGAGACAAGGCTTCAGAA 243. 
■ i GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 180 

""""" i ! 1 1 1 1 1 1 1 1 1 1 II M II II II II II I II I M II 1 1 1 1 II 1 1 II II II M II 11 II II 1 1 

y:> 2 44 UAA.7TGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA .s0.3 

, in CC p CrAArAG AAGCAGTGGCTCAGCTAGCACAAGAAC.TCTACAGCAGTGGCCTGCTAGTG 2 10 

•'' ■ ■, nil : 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 M. M M 1 1 1 rM 1 1 M 1 1 M 1 1 1 1 i 1 1 Mi 

Db ;.; 04 CCCCCGACAGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTGGTG 363 

•; • , ACA ^TGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGAC,GCAGATA 3 0.0 4 . 

" " "| 1 1 ill i || II II II M i M II M M Mi M M M I I! I II M M M M M. II M M II M 

,. b acaCTGATAGCTGACCTGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACGCAGATA 423. 1 

-n 1 tTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 360 

TTTTTT i M mi 1 1 ii ii mi 1 1 1 1 ii ii i ii 1 1 ii ii ii ii ii i ii 1 1 ii ii ii ii i m 

424 TTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGT 4 83 
-*1 PCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 420 

?mMMMMMMIMMM!IMMMIMMMIMIMMIMIIMMMIM ^ 

. Aft4 GCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTA 543 : 

» v i CGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 4 80" 

| | | | | | | | | | I | | I I I I I i I I I I I I I M I I I M I I I M I M I IIMMIMM 

jj ; <H CGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTC 603 

Ov a 8-1 TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 540 

QY 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 M 1 1 1 M i I II 1 1 1 1 M 1 1 

U04 TTTTCTAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCT 66-3 



■ ^ • -, fpAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGAC 600- 

y ; " MIIIMMIMMIMIMMMIMIIIIMMUMMMMMMMIIMMIM 

Db 6 6 4, TCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATAAAGTGTTGGTAGCAGAC 72 3 

nt, «M TTCTTAGAACAAAATTACGACACTA.TTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAG 660 

Qy ■ nmi MiMiiii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 m 1 1 1 1 1 i i i 1 1 1 1 1 



Db 


724 


TTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAG 


783 


Qy 


661 


AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCAC 

1 1 1 1 M M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 II M 1 1 1 1 1 1 1 1 M 1 

AATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCAC 


720 


Db 


7 34 


843 


Qy 


721 


AACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAAC 

■ ■til i i i i i i i I I I I I i * I 1 1 1 t 1 1 1 1 1 i 1 1 1 L 

| M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

AACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAAC 


780 


Db 


344 


903 jz 


Qy 


781 


CTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTT 


840 


Db 


904 


! !| |! 1 1 1 1 1 ' 1 1 1 1 II 1 1 ; 1 1 i 1 1 i M M 1 i 1 Mi 1 1 1 1 1 1 1 1 M l M 1 i 1 M ! M 1 M 

CTCCTTCGGGATAAAAGTCCC AACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTT 


963 


Qy 


»41 


GTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAA 


900 




964 


iMiiiiiii-iiiiiiiiiiiiiiM'iiiiiiiiiiiriiiiiiiri.iiiiiiiiiiiii 

GTGGCCAGTCCTCACAAAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAA 


1023 


Qy 


■901 


CTCATTG AGTTTCTGAGCAGCTTC GAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGAC 

M 1 1 1 ) 1 1 It 1 MIIIIIIMNItlllMM-lllllllillllMIIIIMIIIIiril 

CTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGAC 


960 


Db 


• 1024 


10B3 


Qy 


901 


GAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA 1014 

1 1 1 ! 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 II 1 M 1 1 1 1 i 1 1 II 1 1 M 1 1 1 1 1 1 1 1 M 1 M II 

CAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA 113 7 






1 '..»■! 4 





. /INSULT < 
OS - 09-^1 v '-95.-.'- 534 3 

,: Sequerj \ih-. :i>34 3 , Application us/09913995 ;■ ■ •■■ . .. 

' , ?:;iblicau-on; 'No. US20030073623 Al *" ,; ,: " 

"; S'NSRAL INFORMATION: 

, APPLICANT: Hyseq, Inc. , 

TITLE OF INVENTION: NOVEL NUCLEIC ACID SEQUENCES OBTAINED 
; TITLE OF INVENTION: FROM VARIOUS cDNA LIBRARIES 
; FILE REFERENCE: 20411-756 

; CURRENT APPLICATION NUMBER: US/0.9/918,995 

; CURRENT F I L ING DATE : 2001-07-30 

; PRIOR APPLICATION NUMBER: US/ 09/235 , 076 

; PRIOR FILING DATE: 1999-01-20 

; NUMBER OF SEQ ID NOS : 38054 

: SOFTWARE: FastSEQ for Windows Version 3.0 

; SEQ ID NO 5 34 3 

: LENGTH : 4 75 ' " 

; TYPE: DNA 

.; ORGANISM: Homo sapiens 
FEATURE: 

NAME /KEY : misc_feature 
LOCATION: (1) . . . (475) 
OTHER INFORMATION: n = A,T,C or G 
US-09-918-395-5343 

Query Match 39.3%; Score 3 98; DB 11; Length 475; 

Best Local Similarity 100.0%; Pred. No. 3.5e-104; 

Matches 398; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 



617 ACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATTATGTTACTAAGA ^676 



Db 


1 


|j||||||||||||||MIIMMMMMMMMMIMMMIIIIIIMIIillM 

ACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATTATGTTACTAAGA 


60 


Ov 


677 


GACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATGA 

II II Mill |l MM Ml II III II II Ml II Ml MM II III II N II INI III Ml 

GACAGTCTTTAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATGA 


736 


Db 


61 


120 


Qy 


737 


CAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAACCTCCTTCGGGATAAAA 

1 M 1 1 1 1 1 II M M 1 1 M 1 1 1 M II 1 1 1 1 1 M II II II 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 M 1 II 

CAAAGTATATCAGCAAGCCGGAGAACCTGAAACTCATGATGAACCTCCTTCGGGATAAAA 


796 


Db 


121 


180 


Cy 


71)7 


GTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTTGTGGCCAGTGCTCACA 

j I II 1 i II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 II 1 1 1 II 1 1 II II 1 1 1 II II M 1 1 1 1 II Ml 1 

GTCCCAACATCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTTGTGGCCAGTCCTCACA 


856" 


Db 


181 


240 


Qy 




AAACACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGA 

M || MM II IIMM MM Mill MUM II III MM Ill III II Ml M 

AAArACAGCCTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGA 


916 


Db 


z 4 1 


300 


Qy 


917 


CCAGCTTCCAAAAAGAAAGGACGGATGATGAGGAGTTCGCTGACGAGAAGAACTACTXGA 

11 ill Mil Mill MM MMM MINI HI I'll MUM 1 1 i i 1 II 1 1 1 1 i i 1 1 1 1 1 

GCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTCGCTGACGAGAAGAACTACTTGA 


976 


Db 


301 


360 


Qy 


977 


TTAAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA 1014 

Mil MMM MMM III III II MMM MM MM 

TTAAACAG ATC CG AG ACTTGAAGAAAACGGC CCCTTGA 3 9 8 






3 61 





:!II^ULT 5 ; , , 

■'■^uence :U8, Application US/099.10943 " : 

} ^/Hent No . US20020081610A1 ' * : / 

i: ": N'HRAL .INFORMATION : .'' : 

PL I CANT : Hemmati -Brivanlou Ali 
: ^ PL I CANT : Alcman, Curtis . 

; riTLE OF INVENTION: Assays and Materials for Embryonic Gene Expression 

• FILE REFERENCE: 7 52 9 / 1G14 8US1 

CURRENT APPLICATION NUMBER :. US/09/910 , 943 

• CURRENT FILING DATE: 2001-07-23 
: iiUMBER OF SEQ ID NOS : 742 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 318 
LENGTH: 690 
; TYPE : DNA 

■ ORGANISM : Xenopus laevis 
; FEATURE : 

NAME /KEY : misc_feature 

LOCATION: (1) . . (690) 

OTHER INFORMATION: n may be a or g or c or t/u 
US-09-910-943-318 

Query Match . 28.5%; Score 288.8; DB 9; Length 690; 

Best Local Similarity 80.5%; Pred. No. 1.4e-72; 
- Matches 338; Conservative 0; Mismatches 82; Indels 0; Gaps 

nv 595 GCAGACTTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGAAATTGCTTCAG 6 54 

Y ■ Mill II M M 1 1 1 1 1 1 M 1 1 1 1 1 II Ml I MMM II II Mill! 



Db 69 GCAGAATTTCTAGAGCAAAATTACGACAGAATATTTAATGACTATGAAAAGCTTCTTCAC 128 

0y 5,5 TCTGAGAATTATGTTACTAAGAGACAGTCTTTAAAGCTC 714 

0v i i mm I I II MM! MM II I II I I M 11 



Db 



Db 



I I I I I I I I I I I I I Ml II I M M M II ' 

129 TCTCAGAAcIaTG^ 188 

Qy 715 CGTCACAACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACCT^ 774 . 

y ■■■II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II MM II M I II 1 1 MUM 

189 CGACACAACTTTTCCATTA^ 248 
775 ATGAACCTCCTTCGGGATAAAAGTCCCAACATCCAGTTTGAAGCCTTTCAT 834 

Y 1 1 ill II II II Mill II II I Mill II Mill MUM 

Db 249 j^xcJJ^TCTGCTGCCT 308 

Qy 835 GTGTTTGTGGCCAGTCCTCACAAAACACAGCCTATTGTGGA^ B94 

MilllM II I Ml I I M 1 1 II M II M II M M I I 

Db 309 GTGTTTGTAGCAA^^ 368 

Oy .95 CCCAAACTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACGGATGATGAGCAGTTC 95:4 

' 1 1 1 1 I I II I II I M II Ml M I I 1 1 1 1 II I 1 1 M i II! 

Db 369 ACCAAGTTAATCGACTTCCTGAGCAGCT 428 

O- ^ GCTGACGAGAAGAACTACTTGATTAAACAGATCCGAGACTTGAAGAAAACGGCCCCTTGA. 1014. 

y * i ii M 1 1 1 1 1 1 1 II i M II II 1 1 II I II Ml M I I I M III 

,■■■■■< < r, t 

result ; . t . ' c. 

0S-O9-8»:i7 -701- j263 

; dequeue - 5263, Application US/09867701 ^ 

, .^aUent tio. US20020132237A1 \ 

• r'^J^RAL INFORMATION: ' 

. ifepLICANT : Aglate, Paul A. "'^ 

; SWlj'XCANT: Jones, Robert V 

- ■ 10 LI CANT : Har locker, Susan L. 

■ ^ TLB OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY • 

■ TITLE OF INVENTION: AND DIAGNOSIS OF OVARIAN CANCER 
; FILE REFERENCE: 210121-497 ■ 

■ CURRENT APPLICATION NUMBER: US/09/S67 , 701 

; CURRENT FILING DATE: 2001-05-29 ' , 

'; NUMBER OF SEQ ID NOS : 10912 
; " : SOFTWARE : FastSEQ for Windows Version 4.0 
;. SEQ ID NO 5263 
: LENGTH: 43 5 
TYPE: DNA 

ORGANISM: Homo sapien 
US-09- 867 -701 -5263 

Query Match 24.3%; Score 246.4; DB 10; Length 435; 

B*a* Local Similarity 77.6%; Pred. No. 1.8e-60; 

^ 293; Conservative 0; Mismatches 86; Indels 0; Gaps.. 0, 



,25 TTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATTATGTTACTAA 685 

M || M INN Ml Mill M II I II I II I Ml M 1 1 II Ml I 

D ; 4 - TTTTCAGTGAATATGA^ 



Qy 6 86 



TAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATGACAAAGTAT7' 



100 
\ 74 5 



i Mill || II II II I II II I IMIMM I II IIMIMI II I 

Db i.oi TGAAGCTTCTCGGTGAACTACTACTAGATAGACACAACTTCACAATTATGACAAAATACA 160 



Qy 



74 6 TCAGCAAGCCGGAGAACCTGAAACTCATGATGAACCTCCTTCGGGATAAAAGTCCCAACA 8 05 



,111 II II IMIMM III I III IIIMIM II M II MMIII Ml 

Db 161 TCAGTAAACCTGAGAACCTCAAATTAATGATGAACCTGCTGCGAGACAAAAGTCGCAACA 22 0 

Qv 30 £ tccaGTTTGAAGCCTTTCATGTTTTTAAGGTGTTTGTGGCCAGTCCTCACAAAACACAGC 8 65 

1 1 1 1 M 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 MM Ill II MM 

Db 221 TCCAGTTTGAGGCCTTTCACGTTTTTAAGGTGTTTGTAGCCAATCCTAACAAGACGCAGC 2 80 

c ,. 866 CTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAGCAGCTTCC 925 

! || I II Mill I II II Ml IMIMM M Mill II MM Ml 

Db 231 CCATCCTAGACATCCTCCTCAAGAACCAGGCCAAACTCATAGAGTTCCTCAGC£ AGTTTC 340< 

AAAAAGAAAGGACGGATGATGAGCAGTTCGCTGACGAGAAGAACTACTTGATTAAACAGA 985 

"~ j || |l 1 1 MUM- MMIII MM MUM MM III II I M MM II 

Db 341 AGAACGACAGGACGGAGGATGAGCAGTTTAACGACGAGAAGACCTATTTAGTTAAACAGA 4 0 0 

Cyr 936 TCCCAGACTTGAAGAAAACGGCCC 1009 

MM M MMIII M II I 

rfo 4 01 TCCGGGATTTGAAGAGACCCGCTC 424 



R^.SOJjT 7 

:":£-09-bo7»-7 -.11-3899 \ 
Sequence: [ v'99 t Application US/09867701 ' 
latent No. 7S20020132237A1 ? 1 

; : : " MERAL IKi'OUMATION : . - ^ 

; X7PliICANT: Aglate, - Paul A. r 
PL I CANT: Jones, Robert ;V 

• REPLICANT: Harlocker, Susan L . ■' 
: TLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY • • 

• VJTLS OF INVENTION: AND DIAGNOSIS OF OVARIAN CANCER 

r. Pill-E REFERENCE: 210121.497 . 
■ CURRENT APPLICATION ^NUMBER : US/09/867 , 701 

CURRENT FILING DATE: 2001-05-29 
; NUMBER OF SEQ ID NOS : 10912 

SOFTWARE: Fast SEQ for Windows Version 4.0 
; S2Q ID NO 5899 

LENGTH: 447 
■■• TYVjK : JNA 
; ORGANISM: Homo sapien 

US - 03 - 3S 7 -701-5899 ' 

Query Match 24.1%; Score 244.8; DB 10; Length 447; 

Best Local Similarity 77.3%; Pred. No. 5.3e-60; 

Matches 2 97; Conservative 0; Mismatches 87; Indels 0; Gaps 0 
Qv 62 6 TTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATTATGTTACTAAGAGACAGTCTT 685 

1 1 1 1 II IIIMIM II Mill II II MM MM II II IIIMIM 

- b 4i TTTTCAGTGAATATGAGAAGTTACTTCATTCAGAAAATTATGTGACAAAAAGACAGTCAC 100 

0v 5 86 TAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATGACAAAGTATA 74 5 

MUM II II II II Ml II I Mill Ml I i I II 1 1 II 1 1 I ! I 

Db 101 TGAAGCTTCTCGGTGAACTACTACTAGATAGACACAACTTCACAATTATGACAAAATACA 16 0 



Qv 74 6 TCAGCAAGCCGGAGAACCTGAAACTCATGATGAACCTCCTTCGGGATAAAAGTCCCAACA 8 05 

Mil || II Ml 1 1! I! Ill I Mill Mill Ml II II II I III I Mill 

Db 161 TCAGTAAACCTGAGAACCTCAAATTAATGATGAACCTGCTGCGAGACAAAAGTCGCAACA 22 0 

n<r 80S TCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTTGTGGCCAGTCCTCACAAAACACAGC 865 ■ 

II Ml MM I 1 1 1 1 1 i 1 1 Mill MIMI MINI MM MM MM II MM 

Db 221 TCCAGTTTGAGGCCTTTCACGTTTTTAAGGTGTTTGTAGCCAATCCTAACAAGACGCAGC .2 80 

0v 866 CTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAGCAGCTTCC 92 5 

Ml IIIIIIM I II II III 1 1 1 1 1 1 1 1 1 1 IMM M MM III 

Db 2 81 CCATCCTAGACATCCTCCTCAAGAACCAGGCCAAACTCATAGAGTTCCTCAGCAAGTTTC 34 0 

0v 926 AAAAAGAAAGGACGGATGATGAGCAGJTCGCTGACGAGAAGAACTACTTGATTAAACAGA 985 

I M II MIMI II Mi Mill I II -1111111111 III IT M 

Db 3 4 1 AGAACGACAGGACGGAGGATGAGCAGTTTAACGACGAGAAGACCTATTTAGTTAAACAGA 4 0 0 

Qy 986 TCCGAGACTTGAAGAAAACGGCCC 100 9 

I! I M MM III I I Ml 

T%$ 4 01 TCAGGGATTTGAAGAGACCAGCTC 4 24 

RESULT a ' 
i;n-09-867 701-4953 

.;. Sequence 4953, Application US/09867701 
; Patent NO. US20020132237A1 

; :;:-NERAL INFORMATION: • ■."■ h - ' 

• '.PPL I CANT : Aglate, Paul A. - 
APPLICANT: Jones, Robert 
APPLICANT: Har locker, Susan L. 

: TITLE OF INVENTION : COMPOSITIONS AND METHODS FOR THE THERAPY 

.' I'TTLE OF INVENTION: AND DIAGNOSIS OF OVARIAN CANCER * ; 

; : ?lhE REFERENCE: 210121.497 " - . • ■"" - 

;■ CURRENT APPLICATION NUMBER: US/ 0 9/ 367 , 70 1 . : . 

; CERENT FILING DATE: 2001-05-29 

; .-TMBER OF SEQ ID NOS : 10312 

/ SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ. ID NO 4953 
LENGTH: 450 
TYPEr DNA 

ORGANISM : Homo sapien 
US-09-867-701-4953 

' QWit^r Match 24.1%; Score 244.8; DB 10; Length 450; 

P/cnt Local Similarity 77.3%; Pred. No. 5.3e-60; 

^^tcnes/ 297; Conservative 0; Mismatches 87; Indels 0; Gaps 0; 
. 0v ' 6 9 6 TTTTTGAAGACTATGAGAAATTGCTTCAGTCTGAGAATTATGTTACTAAGAGACAGTCTT. 685. 

Mil II IIIIIIM II Mill II II IMIIIII II 1 1 IIIIIIM 

Db 2 7 TTTTCAGTGAATATGAGAAGTTACTTCATTCAGAAAATTATGTGACAAAAAGACAGTCAC 8 6 

Qv 6 s6 TAAAGCTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATGACAAAGTATA 74 5 

I Mill II II II II I II II MM II III I II MMIMI II I 

Db 37 TGAAGCTTCTCGGTGAACTACTACTAGATAGACACAACTTCACAATTATGACAAAATACA 14 6 

Q V 7.15 TCAGCAAGCCGGAGAACCTGAAACTCATGATGAACCTCCTTCGGGATAAAAGTCCCAACA 8 05 

MM i I M MMIMI Ml I MM I MIMI M II II II Mill Mill 

Db 147 TCAGTAAACCTGAGAACCTCAAATTAATGATGAACCTGCTGCGAGACAAAAGTCGCAACA 2 06 



nv 8 0 6' TCCAGTTTGAAGCCTTTCATGTTTTTAAGGTGTTTGTGGCCAGTCCTCACAAAACACAGC 865 

Y MIIIIMM I II I M II MIIIIMM MIMM 1 1 M MM MM II I Ml 

Db 207 TCCAGTTTGAGGCCTTTCACGTTTTTAAGGTGTTTGTAGCCAATCCTAACAAGACGGAGC 266 

„ fi66 CTATTGTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAGCAGCTTCC 92 5 

Qy i n | || Mill I M M Ill MIMMMI MM I _ 

Db 2G i CCATCCTAGACATCCTCCTCAAGAACCAGGCCAAACTCATAGAGTTCCTCAGCAAGTTTC 32 6 

nv 926 AAAAAGAAAGGACGGATGATGAGCAGTTCGCTGACGAGAAGAACTACTTGATTAAACAGA 985 

y ' | || II 1 1 Mill Mill I Ml III MIIIIMM III II IMIIMII 

Db 3 2 7 AGAACGACAGGACGGAGGATGAGCAGTTTAACGACGAGAAGACCTATTTAGTTAAACAGA 3 86 

Qv q 8 6 TCCGAGACTTGAAGAAAACGGCCC 100 9 

II I M I Ml II I II M I 
Db 3 87 TCAGGGATTTGAAGAGACCAGCTC 4 1 C 

RESULT 9 

US-09 -910-243-35 

; Sequence 35/ Application US/09910943 
; Patent No. US20020081610A1 
; GENERAL INFORMATION: 

APPLICANT: Hemmat i -Brivanlou , Ali 

^PLICANT: Altman, Curtis 

TITLE OF INVENTION: Assays and Materials for Embryonic Gene Expression 
;/ILE REFERENCE: 752 9/1G148US1 : f 

; CURRENT APPLICATION DUMBER: US/09/ 910 , 94.3 • 

: CURRENT FILING DATE: 2 001-07-23 - ■ 

: DUMBER OF SEQ ID NOS ..: ^42 

; SOFTWARE: PatentTn version 3,1 

; SEO ID NO 3 5 

; LENGTH: 762 ;V 
TYPE: DNA 

ORGANISM.: Xenopus laevis 
, . FEATURE : 

NAME /KEY : mis cofeature 
LOCATION: (1) - - (762) 

OTHER INFORMATION: n may be a or g or c or t/u 
U3-G9-910-943-35 

Query Match 20.8%; Score 210.8; DB 9; Length 762; 

Best Local Similarity 78.6%; Pred. No. 4.8e-50; ^ 
Matches 287; Conservative 0; Mismatches 75; Indels 3; ^aps 3 

fr . i ATGAAAAAAATGCCTTTGTTTAGTAAATCACACAAAAATCCAGCAGAAATTGTGAAAATC 60 

" 1 1 Ml MMMM Mill II II Mill MMMM II II I MM MM 

Db 3 97 ATGAAGAAAATGCCATTGTTCAGCAAGTCACATAAAAATCCGGCTGAGATTGTTAAAACT 4 56 

^ ai CTGAAAGACAATTTGGCCATTTTGGAAAAGCAAGACAAAAAGACAGACAAGGCTTCAGAA 12 0 

* Mill Mill Mill I IIMM III MMMM II II Mill II III 

Db 4 ^ 7 cTGAAGGACAACATGGCCCTGCTGGAAAGGCAGGACAAAAAAACTGAAAAGGCCTCTGAA 516 

nv 12 i GAAGTGTCTAAATCACTGCAAGCAATGAAAGAAATTCTGTGTGGTACAAACGAGAAAGAA 180 

Y MMMM Ml Ml M Mill I 1 1 1 1 1 I II II 1 1! II II I II IIMM 

Db 517 GAAGTGTCTAAATCTCTTCAAGCTACAAAAGAGATTTTGTGTGGGACAGGGGACAAAGAA 57 6 



nv -,g, ccccCAAC'AGAAGCAGTGGCTCAGCTAGCACAAGAACTCTACAGCAGTGGCCTGCTAGTG 24 0 

Y " * || | Mill I 1 1 1 1 1 M 1 1 1 1 ! 1 1 1 1 1 1 1 M I 1 1 1 1 M 1 1 111 II I I ! 

Db 5 7 7 ccTCAGACAGAGACGGTGGCTCAGCTCGCACAAGAACTGTACAACAGTGGCTTGTTGGTT 63 6 

n , r 041 ACACTGATAGCTGACC - TGCAGCTGATAGACTTTGAGGGAAAAAAAGATGTGACCCAGAT 2 99 

Q/ || | Mill Ml MM M MM Mi II Ml II IIMMII MIMI 

Db 63 7 ACTTTAATAGCCCACCTTGCATCTCATAGATTTTGANGGCAAGAAAGATGTATCTCAGAT 6 96 

Ov 3 00 ATTTAACAACATCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAG 3 59 

7 Ml II lllll MINI llllll II MINIUM I II III II Ml II II 

Db 697 ATTCNAC -AC ATCCTGAGAAAACAGATTGGC AC TCGGAGTNC - CCTGTGGAGTATATCAA 754 



Qy 3 60 TGCTC 3 64 

I I I 

Db 755 TTCCC 75 9 



RESULT 10 " 

US- : 09-954--456-1453/c ' 

; Sequ^iice 14 5 3 , Application US/09954456 ' • 

• Patent No. US20020115057A1 

; GENERAL INFORMATION: 

; APPLICANT: Young , Paul 

: TITLE OF INVENTION: Process for Identifying Anti-Cancer Therapeutic Agenns 
l:v.i.:::< Cancer Gene ^. 

: I'iiK of invention: sets " s - ; 

' :/TL.3 R^F^RENCE: 6 8 9290-76 . >! - " : ' ' 

: C ERENT APPLICATION NTTMBER: US/ 09/ 954 , 4 56 ' r - 
"; C: .RRE*TT ..TILING DATE: 2001-09-18 J> _- . 

', "-p.il IOR APPLICATION NUMBER: US/60/233 , 617 ■ • : 

; ?hfIOR FILING DATE: 2000-09-18 v . . : "' 

PRIOR APPLICATION NUMBER: UB/ 60/2 34 , 052 - v 

PRIOR FILING DATE: 2000-09-20 
; PRIOR APPLICATION NUMBER: US/60/234 , 923 
; PglOR .'FILING DATE: 2000-09-25 
; i^RTOR. APPLICATION NUMBER: US/60/235, 134 
': PRIOR FILING DATE: 2000-09-25 

; PRIOR APPLICATION NUMBER : US/ 60 /2 3 5 , 637 - - 

; PRIOR FILING DATE: 2000-09-26 

; PRIOR APPLICATION NUMBER: US/60/235 , 638 

; PRIOR FILING DATE: 2000-09-26 

; PRIOR APPLICATION NUMBER: US/60/235 , 711 

; PRIOR FILING DATE: 2000-09-27 

: PRIOR APPLICATION NUMBER: US/ 60 /23 5 , 12 0 

; PRIOR FILING DATE: 2 00 0-09-27 

PRIOR APPLICATION NUMBER: US/ 60 / 23 5 , 84 0 
' PRIOR FILING DATE: 2000-09-27 
;'. PRIOR APPLICATION NUMBER: US/ 60/23 5 , 8 63 
; PRIOR FILING DATE: 2000-09-27 
; NUMBER OF SEQ ID NOS : 2 276 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 1453 
; LENGTH: 3 87 
TYPE: DNA 

ORGANISM: Homo sapiens 
US - 0 9 - .954 -4 56-1453 



Query Match 19-2%; Score 195; DB 10; Length 387; 

Best Local Similarity 100.0%; Pred. No. l.le-45; 

Matches 195; Conservative 0; Mismatches 0; Indexs 0, Gaps 0, 



B20 TTTCATGTTTTTAAGGTGTTTGTGGCCAGTCCTCACAAAACACAGCCTATTCT 



87 9 



g ? 0 TTTCATGTTT 1 TAAUUIU mbi ^^v_™ i ^ x , 7 77,,, 

7 1 1 1 1 1 1 a 1 II 1 1 M M I II 1 1 1 1 M M M 1 II i 1 1 M M I M M I ! 1 ! i ! 

337 TTTCATGTTTTT 328 



Db 

«8C CTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACG 93 9 

Db 327 iiiiiiiJJJ^ 268 

0y ,,0 GATGATGAGCAGTTCGCTGACGAGAAGAACTAOT 

1 1 1 1 1 1 1 1 1 1 1 1 1 M I II II 1 1 1 II 1 1 1 I M II 1 1 II I M M II 1 1 II 1 1 1 1 1 o 

267 C^T^ 208 • 

Qy 1000 AAAACGGCCCCTTGA 1014 

I MMII IN Ml I I 
D b 20?- AAAACGGCCCCTTGA 193 

RESULT 11 ■ ■ 1 

US-09-SSu- 107-481/c 

; Sequence *81, Application US/09880107 • 

• :?a' : ont No. US20020142981A1 \ J;'' " . 

• C777RAL INFORMATION: ' ' 

77 7JiICANT : Home, Darci T . ' " 7 

;>j-:/;LICi\KT; Vockley, Joseph G . ',\'V' 
APPLICANT ■■ Scherf , Uwe - •' ;. 

?.-opL'ICANT: Gene Logic, Inc. - ^ ' \\ 

TITLE OF INVENTION: Gene ■ Expression Profiles in Liver cancer .. . - 

; FILE REFERENCE: 44921-502 8-WO 7-7, 
•■ CURRENT APPLICATION NUMBER: ITS / 09/880 , 107 

• CURRENT FILING DATE: 2001-06-14 

' vp.IOR APPLICATION NUMBER: US 60/211,379 ■; 
; 7'17I0R FILING DATE : 2000-06-14 \i 
■ PRIOR APPLICATION NUMBER: US 60/237,054 

; 7T:I OR FILING DATE: 2000-10-02 / 
; DUMBER OF 3EQ ID NOS : 3950 
; SOFTWARE: Patentln Ve.r. 2.1 
; 3EQ ID NO 4 81 
; LENGTH: 3 87 
TYPE : DNA 

ORGANISM: Homo sapiens 
a TURK • 

'. OTHHfv INFORMATION: Genbank Accession No. US20020142 981A1 AA234362 
ui3-09 ->f0 -107-481 

Cuerv Match .19.2%; Score 195; DB 10; Length 3 87; 

yest bocil Similarity 100.0%; Pred. No. l.le-45;- ' ' . 

Matches 195; Conservative 0; Mismatches 0; Indels 0; Gaps 0, 



820 TTTCATGTTTTT AAGGTGTTTGTGGCCAGTCCTCAC B79 
Db 3 37 UtCATgUtTTaAgGTGTTTGTGG^ 328 



Qy 38 0 cTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAGCAGCTTCCAAAAAGAAAGGACG 93 9 



II I M III || I II I II 1 1 MM II I! Ill 1 1 MUM I II II llllll M I M I II IN I 

Db 327 ctGTTAAAA^ 268 

Ov 940 GATGATGAGCAGTTCGCTGACGAGAAGAACTACTTGATTAAACAGATCC^ 999 

y 1 1 1 1 1 1 1 1 1 1 1 ; 1 1 1 1 1 1 1| || 1 1 M 1 1 II M I II M I M I II M II 1 1 1 1 II II I M M 

Db . 267 GATGATGAGCAGTTCGCTGACGAGAAGA^ 208 

0v 1000 AAAACGGCCCCTTGA 1014 

I II II M M I I M II 
Db 2 07 AAAACGGCCCCTTGA 193 

RESULT 12 

US -10-257-82 6 A - 1 1 3 

; Sequence 113, Application US/10257826A 

• Publication No. US20030181407A1 

• GENERAL INFORMATION: 

; APPLICANT: SA MAJESTE LA REINE DU CHEF DU CANADA 
; APPLICANT: PALIN, Marie-France 
; APPLICANT: POMAR , Candido 
; APPLICANT: GARIEPY, Claude 

; TITLE OF INVENTION: Steatosis -modulating factors and uses 

TITLE OF INVENTION: thereof 
•. FILE REFERENCE: 14654 -2US 

CURRENT APPLICATION NUMBER: US/10/257 , 326A .■ " 

ojRRENT FILING DATE: 2002-10-17 

; >R10R APPLICATION NUMPER: 60/197936 

• PRIOR FILING DATE: 2000-04-17 

' TRIOR APPLICATION NUMBER: PCT/CA01/00509 
; -i'.RICR FILING DATE : 2001-04-12 

■:^jtf&m OF SEQ ID NOS : 3 05 " ''"'. V 

; ^'.^'TWARE: Fast SEQ for Windows Version -4 . 0 

; t d no 118 • " 1 

: LENGTH : 722 

:.YPE: DNA 
; ORGANISM: Artificial Sequence 

FEATURE : 

OTHER INFORMATION: Artificial sequence 
; OTHER V ^FORMATION : Muscular steatosis ■ 

• OTHER INFORMATION: Porcine 
FEATURE : 

NAME/ KEY: misc _f eature 

• LOCATION: (1) ... (722) 

OTHER INFORMATION: n = A,T,.C or G 
US-10-2S7-B26A-118 

. Que3 V, Match 16.7%; Score 169.8; DB 13; Length 722; , v • 

Best" Local Similarity 60.1%; Pred. No. 3.1e-38; _ 
Matches 303 ; Conservative 0; Mismatches 196; Indels 5; Gaps... 4 

347 XGGAGTATATTAGTGCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCC 406 

" IMIII Ml I i I I 1 1 I ! 1 1 i ! MM INI 

Db 3 TGGTGAATNCCTCTGCCCCCACNGAATTTTTGGTCATGGTANTNGAAGGGGATNAATNTT 67 

Ov AQ7 CACAGATTGCCTTACGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACC 466 

' * I I in | | Ml II II Ml I Ml MM II III I Ml MM 

Db 68 CCGAAATTNCNNTTAATTGGGGNATNATGGTNAGANAATGCCTTNGACCTNNACCGCTTG 127 



Qy 
Db 



\S1 CCAAAATCATCCTCTTTTC- -TAATCAATTCAGAGATTTCTTTAAGTACGTGGAGTTGTC 524 

MMMIIII I I I Mill Ml MM I M 1 1 I Ml 

12 8 CCAAAATCATTTTGNGGGCCGAACACAGTTTATAGAGATCTTCACATATGTCTAAATGTN 187 
Pv 5 op « AACA-TTTGATATTGCTTCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACATA 583 

! M Ml I M III I II I II I Ml I 1 1 II I I MINI 

Db 13 8 ANCATTTTNACATATCTTTACATNCNNTTNCCNCATTTTNNGNNTTACTTTCACGACATA 24 7 

rtv 5 8 4 AAGTGTTGGTAGCAGACTTCTTAGAACAAAATTACGACACTATTTTTGAAGACTATGAGA 643 

| || I II I II I M MM M Mill 

2 1 8 TATTGCTCACNGCGCAANTTTTGGAACANCATTATGATANATTTTTCAGTGAATATGATG 3 07 
i'A ' AATTGCTTCAGTCTGAGAATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGAGC 7 03 

|| MUM MM MMMM I I- Mill II I Mill II II II I 

3 0 8 AAGNGCTTCATTCTTAAAATTATGTGGCCACAAGACAATCACTGAAGCTTCTCGGNGAAC 367' 
7 04 TGATCCTGGACCGTCACAACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAACC 76 3- 

I Ml I I I I Ml! Ml i Mill II Ml II II I I Ml 

Db . 368 T AC T A C T ANAT AG ACN CN AC TTCN C CAN T ATG AC C A C AT AC CT C ATT AAAC C TGNGNAC C 427 

Qv 764 TGAAACTCATGATGAACCTCCTTCGGGATAAAAGTCCCAA-CATCCAGTTTGAAGCCTTT 82 2 

| | | 1 1 1 1 II II I II I ! 1 1 I i M 1 1 1 1 1 I II II II II I I I M 

Db ;, 2 a T- CCATTAATGATGAACCTGCCTGCAGACAAAAGTCGGAACCTTCCANTTTGAGGGCTTN 4 8 b. 



Db 



Db 



■ \; ". C ATGTTTTTAAGGTGTTTGTGGCC 34b 

M MMMM I I Ml I 

4 8/ v ,'ACGTTTTT A. ANGGGGNTGTNNNC 510 



T ifji - ?. 0 - 2 f) 1 -'■'■> 2 b A- 119 

; Sequel Lice. 119, Application US / 1025782 6 A 

• Publication No. US2C030131407A1 
; GENERAL INFORMATION: 

; APPLICANT: SA MAJESTE LA REINE DU CHEF DU CANADA 
; APPLICANT: PALIN, Marie-France 

APPLICANT: POMAR, Candido 

APPLICANT: GARIEPY. Claude 
; TITLE OF INVENTION: Steatosis -modulating factors and uses 

TITLE OF INVENTION: thereof 

FILE REFERENCE: 14654 -2U.S 
; CURRENT APPLICATION NUMBER: US/10/257 , 326A 
■ CURRENT FILING DATE: 2002-10-17 
; PRIOR APPLICATION NUMBER: 60/197936 

• PRIOR FILING DATE: 2000-04-17 

; PRIOR APPLICATION NUMBER: PCT/CAO 1/ 005 0 9 
; PRIOR FILING DATE: 2001-04-12 
; NUMBER OF SEQ ID NOS : 3 05 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 119 

LENGTH : 7 0 0 

TYPE : DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Artificial sequence 
OTHER INFORMATION: Muscular steatosis 



OTHER INFORMATION: Porcine 
FEATURE : 

NAME /KEY : misc_ feature 

• LOCATION: (1) . - . (700) 

• OTHER INFORMATION: n = A,T,C or G 
U3-1C-257-826A-119 

QM«y Match 16.4%; Score 166.6; DB 13; Length 700; 

Best Local Similarity 60.1%; Pred. No. 2.6e-37; 

Matches 304; Conservative 0; Mismatches 197; Indels 5; Gaps 4, 
34 5 TGTGGAGTATATTAGTGCTCATCCTCATATCCTGTTTATGCTCCTCAAAGGATATGAAGC 4 04 

I ill I II |M | I I I II I III' I I III II II 

Qb s TCTGGTGAATCCCTCTGCCCCCACNGAATTTTTGGTCATGGTANTNGAAGGGGATNAAT>I 65 

r,- '-05 cCrACAGATTGCCTTACGTTGTGGGATTATGCTGAGAGAATGTATTCGACATGAACCACT 464 

\ Mill I 1 1 1 I i 1 1 1 1 1 I I M 1 1 1 1 II Ml I Ml II. 

Db 66 TTCCGAAATTTCGATTAATTGGGGNATNATGGTNAGANAATGCCTTNGACCTCCACCGCT 12 5 

* 6 5 TGCCAAAATCATCCTCTTTTC - - TAATCAATTCAGAGATTTCTTTAAGTACQTGGAGTTG 5 2 2 

I II I I I I I I I I I I I | II II I III II I I I II II I H 

Db 126 TGCCAAAATCATTTTGNGGGCCGAACACAGTTTATAGAGATCTTCACATATGTCTAAATG 185. 

V -3 TCAACA-TTTGATATTGCTTCAGATGCCTTTGCTACTTTCAAGGATTTACTAACCAGACA 58>U 

" I | l| | | | I M Ml I M I III Ml I Mill I I I II 
; ,-. 0 TNANCATTTTNACATATCTTTAC!ATNCNNTTNCCNCATTTTNNGNNTTACTTTCACGACA 2 4 5 . 

I'AA^G'rGTTGG'rAGCACvACTTCTi'AGAACAAAATTACGACACTATTTTTGAAGAOTATGA 5«£ 

Mill! M I I M Mill MM III 1 1 1 1 M II Ml - 

o TATATTGCTCACNGCGCAANTTTTGGAACANCATTATGATANATTTTTCAGTGAATATGA 3 05. 
:,. i n GAAATTGCTTCAGTCTGAGAATTATGTTACTAAGAGACAGTCTTTAAAGCTGCTAGGGGA 7 0 1 

II MM II III I II M I Ml I ! Mill II I Illli II II II ■ 

06 tgAaGNGCTTCATTCTTAAAATTATGTGGCCACAAGACAATCACTGAAGCTTCTCGGNGA 365. 



.0 



Cv 702 -.^TGATCCTGGACCGTCACAACTTTGCCATCATGACAAAGTATATCAGCAAGCCGGAGAA 761 

7 II IM I I I I MM 1 1 1 I II M I II Ml .11 IN I I 

366 ACTACTACTANATAGACNCNACTTCNCCANTATGACCACATACCTCATTAAACCTGNGNA 425 
Cy 762 CCTGAAACTCATGATGAACCTCCTTCGGGATAAAAGTCCCAA-CATCCAGTTTGAAGCCT 820 

Ml I I II III II I III I I II II Ml II II I MM Mill I M 

Db 4n6 CCT - CCATTA ATGATGAACCTGCCTGCAGACAAAAGTCGGAACCTTCCANTTTGAGGGCT 4 84 

c ,_ g21 XTCATGTTTTTAAGGTGTTTGTGGCC 846 

I II II III IN I i IN I 

r , b 485 tncACGTTTTTAANGGGGNTGTNNUC 510 



RESULT 14 
UC-09-770--445-592 

; Sequence 592, Application US/09770445 
; Patent No. US20020023281A1 
; GENERAL INFORMATION: 

APPLICANT: Gorlach, Jorn 

APPLICANT: An, Yong-Qiang 
; APPLICANT: Hamilton, Carol M. 
; APPLICANT: Price, Jennifer L. 



7\ nriT T r*7\ "NTT • 


Raines , 


Tracy M. 


7\ n fjt TPA "NTT 1 • 


Yu , Yang 




7\ DDT 7 r i Z\TvTT • 


Rameaka, 


Joshua G. 


APPJ-iX^AiMl . 


Page , Amy 


t\ nriT TO A "NTT 1 • 
APPJ-i J-V-AN 1 . 


Matthew, 


Abraham V. 


Tt t\x-vx TP AMI 1 . 

APPblL AN 1 : 


Ledford, 


Brooke L . 


ft nnT T A TvTT 1 . 

APPJj1\-AJN 1 . 


Woessner 


, Jeffrey P. 


APPJ-i J-L-AJN 1 . 


Haas, Wi 


lliam David 


APPLICANT: 


Garcia, 


Carlos A. 


APPLICANT: 


Kricker, 


Maja 


APPLICANT: 


Slader , 


Ted 


APPLICANT : 


Davis, Keith R. 


APPLICANT: 


Allen, Keith 


APPLICANT: 


Hoffman, 


Neil 


APPLICANT: 


Hurban , 


Patrick 


TITLE OF I 


NV.5NTI0N : 


Expressed £ 



TITLE OF INVENTION : thaliana 

FILE REFERENCE: 2023US (PARA- 012PRV) 

CURRENT APPLICATION NUMBER: US/09/770 , 445 

CURRENT FILING DATE: 2.001-01-26 

PRIOR APPLICATION NUMBER: US 60/178,472 

PRIOR FILING DATE: 2000-01-27 

NUMBER OF SEQ ID NOS : 399 

SOFTWARE: FastSEQ for Windows Version 4.0 
CEQ ID NO 5 92 . 
LENGTH: £61 • - *' 

TYPE: DNA 

ORGANISM: Arabidopsis thaliana : * . • • : • 

-.,'3 -7 70- 44 5-S92 '- ' " 

. ;):Vr ; ry Macch 15.4%; Score 156; DB 9; Length 361; ' ■■ 

St focal Similarity 55.8%; Pred. No. 3.4e-34; ' " 

vetches ?19; Conservative 0; Mismatches 250; Indels 3, Gaps 1; 

3 94 GGATATGAAGCCCCACAGATTGCCTTACGTTGTGGGATTATGCTGAGAGAATGTATTCGA 453" 



Mil || | II -II IMI I III I MM I Mill M I 
rib 12 GGGTTTG 



AAAACACCGATATGGCGTTACACTATGGTACTATGTTTAGAGAGTGCATCCGT 7 1 



x54 CATGAACCACTTGCCAAAATCATCCTCTTTTCTAATCAATTCAGAGATTTCTTTAAGTAC 513 

"" Ml | MM III I I II IN I i ! II III I III 

Db 72 CATCAGATTGTTGCAAAATATGTTTTGGACTCGGAGCACGTGAAGAAGTTTTTTTACTAC 131 

14 GTGGAGTTGTCAACATTTGATATTGCTTCAGATGCCTTTGCTACTTTCAAGGATTTACTA 5 7.3 

" | h i I I || || t I I I I i I Mill III Mill Mill IM 

1 3 2 ATACAGCTTCCCAATTTCGACATTGCTGCTGATGCTGCTGCAACTTTTAAGGAACTTCTG 191 
CATAAAGTGTTGGTAGCAGACTTCTTAGAACAAAATTACGACACTATTTTTGAA 63 3 



Qy 



Ov v-74 ACCAGAI 

II II II II II M II \\__\_ 

Db 192 ACAAGG' 



CACAAGTCTACAGTTGCTGAGTTTCTCATTAAGAATGAAGACTGGTTTTTTGCA 251 



n, ■ 6 3- GACTATGA - - - GAAATTGCTTCAGTCTGAGAATTATGTTACTAAGAGACAGTCTTTAAAG 690 

7 " Mill I II I III I II MUM IMI I II II I Ml 

Db 252 GACTACAACTCAAAGCTTCTTGAATCAACTAATTATATTACCCGACGGCAAGCTATTAAG 311 

Ov 691 CTGCTAGGGGAGCTGATCCTGGACCGTCACAACTTTGCCATCATGACAAAGTATATCAGC 750 

V * II I II II I I MM I III II I IMM MIMI I III 

Db 3 12 TTGTTGGGCGATATATTATTGGATAGGTCAAATTCGGCTGTGATGACGAAGTATGTGAGC 3 71 



Db 

Qy 

Eb 



nv 7C1 aagcCGGAGAACCTGAAACTCATGATGAACCTCCTTCGGGATAAAAGTCCCAACATCCAG 810 

I in mi mi I 1 1 1 ! ! 1 1 1 1 I i II I III I MIM 

Db 372 TCGATGGA.TAACTTGAGGATTCTGATGAATCTTCTCAGAGAATCAAGCAAGACTATTCAG 431 

Q/ 811 TTTGAAGC CTTTC ATGTTTTTAAGGTGTTTGTGGCC AGTCCTC AC AAAAC ACAGC CTATT 870 

432 ATAGAAGCTTTCCATGTTTTCAA 491 

871 GTGGAGATCCTGTTAAAAAATCAGCCCAAACTCATTGAGTTTCTGAGCAGCTTCCAAAAA 93 0 

I I II Ml I III IN M I ii 11 11 ' 

4 92 GCCAACATTCTGGTGGCAAACAGAAACAAGCTTCTGAGATTGTTGGCTGATATCAAGCCG 551 

Ov 93 J GAAAGGACGGATGATGAGCAGTTCGCTGACGA 962 

III 1 1 1 1 1 I 1 1 i M I I 

Db 552 GACAAAGAGGACGAGAGGTTTGACGCAGACAA 5 83 

RESULT 15 ' ' ,; ' . 

US-09-'>23 -o76-1251 • • - 

; Sequencs 1251, Application US/09923876 / ' 

; Patent Mo. US20020013958A1 

,- general information-. 

• ; .'PLICAnT: Lalgudi, Raghunath V. 
APPLICANT: Kamigaki , Laura Y. (Ito) 

: itS^iavSSST'^SSS^™- ^.PEPTIDES DERIVED EEOK =0R» 

r:::";y;;. s iNG . ■ . ■ ' ' " 

: TLK REFERENCE: L'L-0 012-1 CON ^ . 

: . RENT APPLICATION NUMBER: US/09/923,876 ' ' 

. RENT FILING DATE: 2001-08-06 

; ^ffefoft- APPLICATION NIJMBGR: 09/298,329 N 

• . .;-rOR FILING DATE: 1 999 -04-21 - , - 
■ PRIOR APPLICATION NUMBER: 60/085,331 ! . 

; PRIOR FILING DATE: 1998-05 05 
; NUMBER OF SEQ ID HQS : 63 32 
; SOFTWARE : PERL Program 

; SBO ID NO 12 51 ; t , 

; LTNCTH : 2 62 './ . 

;' TYPE: DNA 

ORGANISM: Zea mays 
; FEATURE: 

v "-ME /KEY : misc feature 
; OTHER INFORMATION : Incyta ID No. US20020013958A1 700158378H1 

; NAM3/.I05Y: unsure 
• LOCATION : 148 

, ' OTHER INFORMATION: a, t, c, g, or other 
US-09-923-876-1251 

Qur-rw Match 7.3%; Score 74.2; DB 9; Length 262; ■ • 

*est r Local Similarity 55.5%; Pred. No. 6.4e-ll; ■ - 

Matches- 142; Conservative 0; Mismatches 114; Indels 0; Gaps 0 

nv 311 TCTTGAGAAGACAGATAGGCACTCGGAGTCCTACTGTGGAGTATATTAGTGCTCATCCTC 370 

I I I I I I I I I I I I I I II M I I I I I I 1111 1 

Db 7 TGTTGAGACAGAAGGTTGATGAAAGCTATTGTTGCGTCCAGTATATTGAAAATCATTTTG 66 



Ov '71 ATATCCTGTTTATGCTCCTCAAAGGATATGAAGCCCCACAGATTGCCTTACGTTGTGGGA 43 0 

mi II | I 1 1 I I I ! I ! I I I 1 1 1 1 1 1 1 1 1 1 I 

•~ 7 ATC^TTTGGAT^TCCTTGTTGTTTGCTATAAGAACTTGGAAGTCGCGTTGAATTGTGGAA 126 



Db 



*->1 TTATGCTGAGAGAATGTATTCGACATGAACCACTTGCCAAAATCATCCTCTTTTCTAATC 490 

III II II Mill M I I > I I II Ml Ml M I Ml 

127 ACATGTTGCGAGAATGCATAANATATCCTACACTTGCAAAATATATATTGGAGTCAAGCA 186 

4-3- AATTCAGAGATTTCTTTAAGTACGTGGAGTTGTCAACATTTGATATTGCTTCAGATGCCT 550 

Ml || II Mil II II lllllll M MIIIIIMI Mill 

187 GCTTCGAGTTGTTTTTCCAGTATGTTGAATTGTCAAACTTCGATATTGCATCTGATGCTC 246 



Db 

Qy 

Db 

Qy 



U=>i TT3CTACTTTCAAGGA 566 

I 1 1 1 1 M I M II 

cb 247 TGAAC ACTTTCA AGGA 2 62 



Search completed: January 6, 2004, 05:04:45 

Job ti-ss : 1 '00 sees 



